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Abstract 


This work presents the search for the semileptonic baryonic B decay B~ —> A^p^~Vi .The 
used data comprises the complete BaBAR data set of 470 • 10 6 BB events, collected at the 
SLAC National Accelerator Laboratory. Using a pole-model decay simulation, we obtain 
upper limits with 90% confidence level of 


B(B -a A+pe v t 
B(B~ -a 

B{B~ -A A+pru, 


B(A+ -A pK 7T + ) 

5% 

B(A+ -a pK~n + ) 
5% 

B{A+ -a pK~ 7r + ) 
5% 


< 1.2 x 10' 4 , 

< 2.5 x 10" 4 , 

< 1.0 x 10” 4 . 


These results are in slight tension with predictions based on the measurement of B -A 

ppt~Vi and B~ -A A+pir-. 


Kurzfassung 

In der vorliegenden Arbeit wird die Sue he nach dem semi-leptonischen, baryonischen B 
Zerfall B~ -A Ajpi^ue prasentiert. Die genutzten Daten entsprechen dem kompletten 
BaBAR Datensatz von 470 • 10 6 BB Paaren, welcher am SLAC National Accelerator Labo¬ 
ratory gesammelt wurde. Mittels einer Zerfallssimulation auf Grundlage eines Polmodels 
erhalten wir Limits mit 90% Vertrauensniveau von 


B(B -a A+pe v ( 
B(B~ -A A+pp~V h 
B(B~ -a /1+pf A 


B(A+ -a pK 7r + ) 

5% 

B{A+ -a pK~ 7r + ) 
5% 

B(A+ -a pK~n + ) 
5% 


< 1.2 x 10~ 4 , 

< 2.5 x 10” 4 , 

< 1.0 x 10~ 4 . 


Diese Ergebnisse stehen in leichtem Widerspruch zu Vorhersagen basierend auf der Mes- 
sung der Zerfalle B~ -A ppl~V( und B~ -A A^pn~ . 
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1. Introduction 


In 1973 Kobayashi and Maskawa postulated a third quark family, consisting of the top and 
bottom quark. These new quarks were needed to incorporate CP-violation into the elec- 
troweak Standard Model framework. The first measurement of a bound bb state took place 
in 1977, when the Columbia-Fermilab-Stony Brook collaboration at Fermilab discovered 
the T(IS') resonance. After further data taking, they were able to obtain evidence for two 
additional resonances, the T(2S) and the T(3S) in the process p-\-nucleus —>• + X. 

Later on these results were confirmed by e + e~ annihilation experiments at DORIS (stor¬ 
age ring at DESY, Hamburg, Germany) and at CESR (storage ring at Cornell University, 
USA). In addition, experiments at DORIS and CESR were able to establish the T as a 
bound state of a b quark, with charge — and its anti-particle. 

In order to estimate further quantum numbers the production of hadrons with b-flavor 
(e.g. 5-mesons: 5 = bq) was required. Theoretical models proposed that the yet unseen 
T(AS) and higher resonances should have a mass above the BB production threshold, 
which would enable the production of BB pairs. The CLEO and CUSB experiments at 
CESR observed these higher resonances in 1980. In the next decades experiments like 
CLEO and ARGUS (Detector at the DORIS storage ring) collected data on the decay of 
5-mesons. Therefore, 5-mesons were produced in the reaction e + e _ —> T(AS) —> 55. 
These experiments opened a new field of research known as 5-physics, providing a test¬ 
ing ground for the Standard Model. This new research field enabled physicists to confirm 
parts of the Standard Model with high precision. In addition, 5-meson decays are a sen¬ 
sitive probe for physics beyond the Standard Model, and allow us to test new theories. 

In the late 90s of the 20th century the 5-Factories BaBAR (Detector at SLAC, USA) and 
Belle (Detector at KEK, Japan) started data taking and have collected large data samples 
ever since. These data sets enable studies of rare 5-decays. 

A substantial fraction of all 5-decays produce baryons in the final state, which was con¬ 
firmed by the ARGUS collaboration in 1992. They measured the inclusive branching 
fraction of 5-mesons into baryons flU to be 


5(5 ->■ baryons + X) = (6.8 ± 0.5 sta t ± 0.3 syst )%. (1.1) 

Up to now little is known about the underlying production mechanisms of baryons in 
weak 5-decays. One of the major drawbacks is that for most baryonic 5-decays sev- 
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eral decay mechanisms have to be taken into account without knowledge of their impact 
on the total branching fraction. Since theoretical predictions are rare, due to the non- 
perturbative nature of the describing quantum field theory (quantum chromodynamics, 
QCD), it is necessary to measure 5-decays proceeding exclusively via one mechanism. 
This requirement is met by the semileptonic 5-decay B~ —> A^p£~Vi investigated in the 
present work. 

1.1 The Standard Model 

The Standard Model of Particle Physics was developed in the 20th century and has be¬ 
come one of the most successful theories in physics. According to the Standard Model all 
matter in the universe consists of a few fundamental particles, the leptons and the quarks. 
While leptons can occur isolated, quarks can only exist in bound states, the mesons and 
baryons. Moreover, three of the four fundamental interactions can be described by the 
standard model. The interactions incorporated into the theoretical framework of the Stan¬ 
dard Model are the electromagnetic, the weak and the strong interaction. Up to now it 
is not possible to describe gravity in terms of the Standard Model since a quantum field 
theory of gravity is still missing. 

In the next sections the Review of Particle Physics 2012 Q was taken as reference for 
particle properties unless stated differently. 

1.1.1 The Fundamental Particles 

In the Standard Model two groups of fundamental particles are used to describe the visible 
matter in the universe: the quarks and the leptons. Both groups consist of fermions, 
particles with spin 1/2. The next sections give an overview of the basic properties of 
these two groups. Here, we follow the description given in my diploma thesis ifTOl . 

1.1.1.1 The Leptons 

The first member of this group of particles, the electron, was discovered in the year 1897 
by J.J. Thomson. The muon followed in 1937, and the the tauon r, as last charged member 
of this group, was discovered in 1975. The neutral partner of these charged leptons was 
detected in 1956 by Cowan and Reines. In 1962 it was shown by Lederman et al. that 
there is a substantial difference between the electron and the muon neutrino. Today the 
existence of three types of neutrinos, corresponding to the charged leptons, is established. 
In Table 1 1.1 1 the basic properties of the three charged and the three uncharged leptons are 
shown. Here the electric charge is given in units of the electron charge e. 


Table 1.1: The leptons and their basic properties 



mass in MeV/c 2 

electric charge 

electron 

0.510998928 ± 0.000000011 

-1 

electron-neutrino 

< 2 • 10“ 6 

0 

muon 

105.6583715 ±0.0000035 

-1 

muon-neutrino 

< 2•10“ 6 

0 

tauon 

1776.82 ±0.16 

-1 

tauon-neutrino 

< 2 • 10“ 6 

0 
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As can be seen in Table 0 the leptons can be arranged in three families, each family 
consisting of a charged particle and the corresponding neutrino. 

While the masses of the charged leptons are well known it has only been possible to esti¬ 
mate upper limits for the neutrino masses. Up to now only the mass differences between 
the different neutrino types have been measured with high precision. But neither the mass 
hierarchy of the neutrinos nor an absolute mass value for one of the neutrinos has been 
measured. 

1.1.1.2 The Quarks 

The abundance of discovered hadrons, particles bound by the strong interaction, required 
a new ordering scheme. This scheme was introduced by M. Gell-Mann, who arranged the 
known hadrons into several multiplets. Later on this classification could be explained by a 
new group of fundamental particles which are the building blocks of the known hadrons. 
This new group of particles was coined quarks. 

Today we know six quarks of different flavor (up, down, charm, strange, top and bot¬ 
tom), arranged in three families, analogous to the three lepton families. The three quark- 
families, together with their basic properties, can be seen in Table 1 1 .2} 


Table 1.2: The quarks and their basic properties 



current mass in MeV/c 2 

electric charge 

up 

down 

9 q+0.7 
z -°-0.5 

4 8 +0 ' 7 

+ 2 

~ 3 

1 

3 

charm 

1275 ± 25 

+ - 
~ 3 

strange 

95 ±5 

1 

3 

top 

173500 ± 600 ± 800 

+ - 
~ 3 

bottom 

4180 ± 30 

1 

3 


As it can be seen from the table, quarks exist with two different charges, +| and — 
Further their masses range from « IMeV/c 2 to 174GeV/c 2 , a multitude of the proton 
mass. 

1.1.2 The Fundamental Forces 

These two groups of fundamental particles interact with each other via four fundamental 
forces, listed in Table 1 1,3] in the order of their relative strength. 

Table 1.3: The four fundamental forces, listed in the order of their relative strength, at 

Q 2 = 0 


force 

strength 

theory 

gauge-bosons 

strong 

1 • 10 

QCD 

8 gluons g 

electromagnetic 

o 

1 

to 

QED 

photon 7 

weak 

o 

1 

co 

QFD 

W and Z° 

gravitational 

O 

1 

to 

GTR 

graviton 
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In the context of the SM only the first three interactions can be described by a quantum 
field theory (QFT). Gravity can only be described in terms of the general theory of rela¬ 
tivity which cannot be combined with the QFT of the SM. Hence, gravity is no part of the 
SM of Particle Physics. 

1.1.2.1 The electromagnetic interaction 

Quantum electrodynamics, which describes the electromagnetic interaction, is the oldest 
and most successful theory of the dynamic theories. It describes the interaction between 
charged particles by the exchange of a massless, electrically neutral boson (particle with 
integer spin): the photon 7 . In the limit of strong fields QED passes into classical electro¬ 
dynamics, described by Maxwell’s equations. 

Up to now QED predictions meet experiments with an extremely high degree of accuracy: 
currently about 1 CT 12 [fTTIl . 


1.1.2.2 The weak interaction 


In contrast to the other interactions described in the Standard Model the weak interaction 
distinguishes between left and right. It only affects left handed particles (particles with a 
spin antiparallel to their momentum) and right handed anti-particles, which means that it 
violates parity symmetry (P). Moreover the weak interaction violates CP-symmetry, i.e. 
particles and anti-particles behave differently under the weak interaction. This is a con¬ 
tribution to the dominance of matter in the universe. It is also the only interaction that is 
able to change the flavor of a particle, i.e. it can change an up-type quark into a down-type 
quark and vice versa. 

The weak interaction acts on all quarks and leptons, and is described by the exchange 
of heavy bosons, the W ± and Z° which have a mass of (80.385 ± 0.025) GeV/c 2 and 
(91.1876 ± 0.0021) GeV/c 2 0.71, respectively. The large mass of the force carriers leads to 
a range smaller than the diameter of a nucleus, in contrast to the electromagnetic interac¬ 
tion which has an infinite range. 

While the weak interaction conserves the family in the lepton sector, i.e. the number of 
particles from a specific family is conserved, this does not hold true for weak processes 
in the quark sector. In order to explain this discrepancy Cabibbo suggested 1963 that the 
quark generations are “rotated” for the weak interaction, consequently the weak eigen¬ 
states are different from the strong eigenstates. The eigenstates of the weak interaction 
are 


u 

d' 


” 7 


c \ I t 

’ b' 


(F2) 


in contrast to the strong eigenstates 




(1.3) 



1.1. The Standard Model 


5 


Here d', s' and b' are linear combinations of the physical quarks d, s and b. The relation be¬ 
tween the “twisted” and the physical quarks is given by the Cabibbo-Kobayashi-Mciskawa 
matrix (CKM matrix) V C km- 


( d'\ 


( Vud, Vus Vub ^ 


(d\ 

s' 

= 

Vcd Vcs ^cfe 


S 

W ) 


y Vtd. Vts Vtb ) 


{ b / 


At this point V a b names the relative coupling strength between a and b. Experiments have 
delivered the following magnitudes of all nine CKM matrix elements [171 : 



( 

0.97427 ±0.00015 

0.22534 ± 0.00065 

0 0 0 3 51 + 0 00015 
u.uuooi_ 0 00014 

\ 

\Vckm\ — 


0.22520 ± 0.00065 

0.97344 ± 0.00016 

0 0412 +aoon 
u.u / ±iz_ 0 0005 



V 

0.00867±" 

(1 04 04 +0 0011 

U.U^U^_ 0 0005 

(1 QQQ14fi + a000021 

u.yyy±^u_ 0i00004 6 

/ 


(1.5) 


As it can be seen from eq. ( |1.5[ ) transitions inside one family (e.g. c —>■ s) arc much 
more likely than transitions between different families (e.g. c —* d) which are called 
“Cabibbo-suppressed”. 


1.1.2.3 The strong interaction 

The strong interaction is responsible for the coupling of the quarks in bound states. While 
the electromagnetic interaction couples to the electric charge, and the weak interaction 
couples to the weak charge, the strong interaction couples to the color-charge of the 
quarks. Quarks exist in three colors (red, green and blue) and anti-quarks in three anti¬ 
colors (anti-red, anti-green and anti-blue). But it is only possible to observe the colorless 
bound states. In mesons the color of the quark and the anti-color of the anti-quark com¬ 
pensate (e.g. red and anti-red), while in baryons all three colors have to appear which 
leads to a colorless particle. 

The strong interaction is mediated by eight gluons, massless particles carrying a color- 
and an anti-color charge. In contrast to the electromagnetic and weak force, the strong 
force can not be described in terms of a simple l/r law (r is the distance between the 
interacting particles), since the strong coupling constant is a running constant, as shown 
in Figure 11.1 [ At high energies the quarks are close together and the interaction is weak 
(asymptotic freedom), while at low energies the distance between them is large and the 
interaction is strong (confinement). 
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1. Introduction 



Figure 1.1: Summary of the values of a s (/x) at the energy scale of // where they are 
measured. The lines show the central values and the ±lcr limits of the average (T). 


Asymptotic freedom can be explained by the self-interacting nature of the gluons. This 
self-interaction leads to a weak interaction between the quarks at small distances, and 
hence to asymptotic freedom. To understand confinement the strong interaction can be 
interpreted as a string, if two quarks are separated the energy density stored in the con¬ 
necting string rises. At sufficient high densities the string breaks up forming a quark- 
antiquark pair, which are bound to the primary quarks. Consequently quarks can not be 
referred to as free particles at low energies. 

1.1.3 Bound States 

As described before the strong interaction binds the quarks into colorless states, the 
hadrons. Today two colorless bound states of quarks are well established: the mesons 
and the baryons. This two groups will be explained in more detail in the next two sec¬ 
tions. 

Recent measurements by Belle ffT2l and LHCb [1T3TI seem to point to a new group of bound 
states, so called tetraquarks. But, since the nature of the observed tetraquark candidate 
Z (4430) is not confirmed, this group of hadrons will be omitted. 

1.1.3.1 Mesons 

Mesons, consisting of a quark and an antiquark, pose the most simple combination of 
quarks into a colorless bound state. Meson ground states can be divided into pseudoscalar 
and vector mesons, depending on the spin orientation of the quark and antiquark. If the 
spins are aligned antiparallel "fl the meson is called a pseudoscalar meson, while a parallel 
alignment is called a vector meson. Similar to atomic spectroscopy, excited states of 
mesons are possible by different values of the angular momentum L. 

However, the picture, that a meson only consists of a quark and an antiquark, is too 
simple. Since the quarks are bound by gluons which are self-interacting and can fluctuate 
into quark-antiquark pairs the inner structure of a meson is much more complicated. The 
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current model of the structure of a meson is that it consists of the valence quarks, the sea 
quarks and the sea gluons. The meson type is determined by the valence quarks alone. 
The basic properties of the mesons relevant for this work are shown in Table |1.4[ Here 
the width is an equivalent value to the lifetime for short-lived particles. 


1.1.3.2 Baryons 

Baryons, as second type of a bound state of quarks, consist of 3 valence quarks. The 
naming scheme relates to the isospin, as well as the quark content. For example baryons 
containing only two u or d quarks is called a A (isospin 0) or E (isospin 1). If the third 
quark is a charm or bottom quark it is given as index. For different mass states of the same 
quark content and isospin configuration the mass is given, e.g. ± c (2520). Consisting of 
three valence quarks they follow the Fermi statistics. Well known baryons are the proton 
and the neutron as constituents of the nucleus. Table |1.5| shows the basic properties of the 
baryons relevant for this work. 

Table 1.4: Mesons relevant for this work and their basic properties 


Meson 

quarks 

mass in MeV/c 2 

lifetime 

width T in MeV 

7T + 

ud 

139.57018 ± 0.00035 

(2.6033 ± 0.0005) • 10" 8 

— 

K+ 

us 

493.677 ±0.016 

(1.2385 ± 0.0024) • 10" 8 

— 

B° 

db 

5279.4 ±0.5 

(1.530 ±0.009) • FT 12 

— 

T{AS) 

bb 

10579.4 ± 1.2 

— 

20.5 ±2.5 


Table 1.5: Baryons relevant for this work and their basic properties 


B ary on 

quarks 

mass in MeV/c 2 

lifetime 

width T in MeV 

p,p 

uud 

938.27203 ± 0.00008 

> 10 31 years 

— 

At 

cud 

2286.46 ±0.14 

(200 ±6) • 10 _15 s 

— 

E c ( 2455)++ 

cuu 

2453.98 ±0.16 

— 

2.26 ±0.25 

± c (2455)+ 

cud 

2452.9 ± 0.4 

— 

< 4.6 

±7(2455)° 

cdd 

2453.74 ±0.16 

— 

2.16 ±0.26 
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2. Baryonic B decays 


Baryons are the main constituent of the visible matter in our universe, but despite their 
large importance for our understanding of the universe little is known about their produc¬ 
tion mechanisms. A possibility is the production in decays of heavy mesons. Here, B 
mesons are the first known mesons heavy enough to decay into a large variety of bary¬ 
onic final states. In addition, decays into baryons make up a significant part of the overall 
branching fraction of B mesons. 

In the next sections a summary of the most striking results is given. 


2.1 Multiplicity hierarchy 


5-decays to baryons are not as well studied as mesonic decays. Previous measurements 
show a strong hierarchy of the branching fractions depending on the final state multiplic¬ 


ity, as shown in Table 2.1 for B decays to a /1+ baryon and in Table 2.2 for decays to a 


charmed meson, accompanied by a baryon-antibaryon pair. For B decays to a charmed 
baryon the branching fraction increases by an order of magnitude comparing the two- 
body decay B° —> A+p with the three-body decays 5° —>• A+pir° and B~ —> A+pn~. 
This increase is in contrast to the mesonic B decays, where the three-body branching 
fraction is at the same order of magnitude as for the corresponding two-body mode. A 
possible explanation for the strong increase from the two-body to the three-body decay 


Table 2.1: Branching fraction results for B —>■ A+p m ■ n with m = 0,1, 2, 3, ordered 
according to their multiplicity [T7T1. 


decay mode 

B±cr( 10” 4 ) 

B° Atp 

0.20 ±0.04 

B° —> A^pTT 0 

1.9 ±0.5 

B~ —>■ A+pir~ 

2.8 ±0.8 

B° —> A^pTl~n + 

11.2 ±3.2 

B~ —>■ A^ p7T~ 7l~ 7T + 

22 ±7 
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2. Baryonic B decays 


comes from the resonant substructure. The additional pion allows for E c and nucleon 
resonances, thus increasing the number of possible decay paths. Further, according to 
Ifl4l the production of a two-body baryonic final state requires a hard gluon, introducing 
a strong suppression factor. Adding a light meson reduces the invariant mass of the re¬ 
maining system, allowing for a soft gluon in the baryon production. For the decay into a 
charmed meson accompanied by a baryon-antibaryon pair the branching fraction reaches 
its maximum for multiplicities of four, as can be seen in Table |2.2| 

Table 2.2: Branching fraction results for B —>■ Dpp m • n with m = 0,1,2 [6], ordered 
according to their multiplicity. 


B decay 

B ± CTrtat ± CX syst (10 4 ) 

B° — > D°pp 

1.02 ±0.04 ±0.06 

B° — > D*°pp 

0.97 ±0.07 ±0.09 

B° —> D + ppn~ 

3.32 ±0.10 ±0.29 

B° —> D* + ppTr~ 

4.55 ±0.16 ±0.39 

B~ —> D°pp7r~ 

3.72 ±0.11 ±0.25 

B~ — > D*°ppiT~ 

3.73 ±0.17 ±0.27 

B° —> D°ppTT~TT + 

2.99 ±0.21 ±0.45 

B° —> D*°pp7r~7r + 

1.91 ±0.36 ±0.29 

B~ —>■ D + ppiT~n~ 

1.66 ±0.13 ±0.27 

B~ —>- D* + ppTT~7l~ 

1.86 ±0.16 ±0.19 


2.2 Threshold enhancement 


A feature of baryonic B decays observed quite frequently is an enhancement at low invari¬ 
ant baryon-antibaryon masses. Examples are shown in Fig. 2.1 Common to all the given 
examples is a deviation from a simple phasespace model at the invariant-mass threshold. 


Explanations for this feature come from different sides. For decays of the type B —y 
K ± X, with the fundamental subprocess b —>■ s + g + g a strong contribution from a 
flavor-singlet penguin is expected. In terms of baryonic B decays like B + —* ppK + a 
dominant contribution of a pp bound state with J pc = 0 ±+ can provide a fair fraction of 
the observed final state |[T5l . 


For decays not dominated by flavor-singlet penguins a possible explanation for the thresh¬ 
old enhancement comes from the fragmentation into hadrons. If the baryons are neighbors 
in the fragmentation chain one expects their invariant mass to be low. A pole-model re¬ 
lated rule of thumb, given in [IT6TI . is to check if the decay could proceed via an initial 
meson-meson or baryon-antibaryon configuration. In the first case one of the mesons 
decays into a baryon antibaryon pair, giving rise to the low mass enhancement. In the 
latter case no such enhancement is expected. This model is quite successful in explain¬ 
ing the results in B° —> A p pTr~iT + . In this analysis [ 16!] an enhancement is seen for 
the resonant subdecay B° — > 27 c (2455) ++ p 7r_ > while no enhancement is visible for 
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m(A L ! p) [GeV/c 2 ] 


(c) 



o 

LO 



CO 



(d) 


Figure 2.1: The invariant baryon antibaryon mass distributio ns f or (a) B —> A+pn 
dU, |(b)| B° Apn + []3l, |(c)j B° -)■ A+pK~n + 0 0 and |(dj| -)■ D+ppii- (g). 


For the upper row the signal distribution was divided by the expectation from a simple 
phase space model. The lower row shows the signal distribution, with the phase space 
expectation represented as a histogram for B° —> I) + ppn ~. 
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2. Baryonic B decays 



Figure 2.2: Comparison of phase space distributed signal Monte Carlo (histogram) with 
^Plotted data (data points), |(a)]shows the invariant E c (2A55) ++ p mass, and |(b)| the invari¬ 
ant 27 c (2455 )°p mass. 


B° —> I7 c (2455) 0 p7r + . A comparison of phase space simulation and experimental data 


is shown in Fig. 2.2 Comparing the two distributions evidence for a low mass en¬ 
hancement in m(Z , c (2455) ++ p) is visible, while the low mass region in m(27 c (2455) 0 p) 
is unpopulated. In lfl6l a simple model to explain the absence of an enhancement in 
m(27 c (2455)°p) is suggested. In this model the Feynman diagrams are categorized into 
two classes, according to their quark configuration after the weak decay. 


• meson-meson configuration, i.e., before quark fragmentation, the quarks are ar¬ 
ranged in two (virtual) mesons. 

• diquark-diquark configuration, i.e., before quark fragmentation, the quarks are 
arranged in a diquark (qiq 2 ) and an anti-diquark state. 


In the meson-meson configuration one of the mesons fragments into a baryon anti-baryon 
pair, while the second meson carries away momentum, leading to a threshold enhance¬ 
ment. In the diquark-diquark configuration such an enhancement is not possible, since 
we already start from a baryon anti-baryon configuration. Comparing the Feynman graphs 
shown in Fig. 2.3 and 2.4| only the external graph in Fig. 2.4(c) is in the meson-meson 
configuration, which explains the absence of an enhancement for B° —> 27 c (2455)°p7r + . 


This approach seems to be valid for other B meson decays into baryons as well, and is 
equivalent to the explanation via the fragmentation mechanism given in [U5ll . 




(a) 


(b) 


(c) 


Figure 2.3: Comparison of the Feynman graphs responsible for the decay B° 

E c (2455)°pn + . 
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(a) 


(b) 


(c) 


Figure 2.4: Comparison of the Feynman graphs responsible for the decay B° —> 

S c (2455) ++ pn~. 


2.3 Semileptonic B decays into baryons 


Starting from the Feynman graphs shown in |2.3| and |2.4| we can assume that the thresh¬ 
old enhancement is caused by the external Feynman graph. In order to assess the rel¬ 
ative influence of the external graph we have to investigate a decay, that can proceed 
only via an external graph. The ideal object for such a study is the semileptonic decay 
B~ —y A+pl~Vp, with t~ = (e~, p~) and up = (v e ,u^) (Charge conjugation is implied 
throughout this work). The corresponding Feynman graph is shown in Fig. 23} 



Figure 2.5: Feynman graph for the decay B —» A+pi vi- 


Up to now, there are only upper limits for its relative strength available. The CLEO 
collaboration showed that the ratio of B —> A^Xe~V e to B —$■ A+X is smaller than 
5% at 90% C.L. llTTll . But this result has two caveats. First, the lepton momentum is 
required to be greater than 0.6 GeV/c, which reduces background from fake and secondary 
electrons, but the signal efficiency as well. Second, uncorrelated B —> A~X events pose 
a large source of systematic uncertainty. To circumvent these caveats BaBAR uses a tagged 
approach, by reconstructing a B meson in a hadronic mode and looking for the signal in 
its recoil [fl8ll . With this approach BaBAR determines the before mentioned ratio to be 


B(B -A A+X£~ v t ) 
B(B -A A+X) 


< 3.5% 


( 2 . 1 ) 


at the 90% confidence level. 

The first direct measurement of a semileptonic baryonic B decay is a recent publication 
by the Belle collaboration Ifl9l . Like the previously mentioned BaBAR measurement they 
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2. Baryonic B decays 


use a tagged approach, and found evidence for the decay B~ —> ppt~V(_. For this mea¬ 
surement Belle studied 615 exclusive hadronic decays of charged B mesons. Paired with 
the large dataset of 772 million BB pairs they measured a branching fraction of 

B(B~ —tpp£~l7 ( ) = (5.8l2n(stat) ± 0.9(syst)) x 10~ 6 (2.2) 

with a significance of 3.2cr. The corresponding upper limit at 90%C.L. is 

B(B~ ->■ ppru e ) < 9.6 x 10“ 4 . (2.3) 

Starting from the measurement of B — > pp£~T> f a rough estimate of the branching frac¬ 
tion for B~ —> can be obtained. Neglecting phase space differences, the only 

difference is the CKM matrix element in the b quark decay. For the former \V u b\ has to 
be considered, while the latter depends on \V cb \. The ratio \V cb /V ub \ equals 9.9 0, and 
hence we can expect 


B{B~ ->• A+pTVi) « (5.7 ± 2.9) x 1(T 4 . (2.4) 


Another approach to obtain a branching fraction estimate is the fully hadronic decay 
B~ — * A+pn- which is measured to be (2.8 ± 0.8) x 10” 4 Q. Neglecting any influ¬ 
ence of internal W emission diagrams we can obtain a branching fraction by assuming 

B(B~ -» A+pt-Vi) _ B(r~ -» 

B(B~ —y A+pn~) B(t~ —> ti~u t ) 

B(t~ —y tt~u t ) is measured to (10.83 ± 0.06)%, B{t~ —> e~V e u T ) to (17.83 ± 0.04)%, 
B(t~ —> to (17.41 ± 0.04)% Q. Neglecting the small difference between the 

latter two the ratio is roughly 1.8, which leads to 

B[B~ ->• A+prv t ) w (4.7 ± 1.3) x 10~ 4 . (2.6) 


An estimate for the semileptonic branching fraction could be obtained in the isospin 
analysis of B° — > A+pir 0 and B° —> A+pq [20). Based on isospin relations the au¬ 
thor predicts the relative strength of the external Feynman graph in B~ —>■ Aypn' of 
(0.20 ± 0.45) x 10” 4 . Combined with the ratio of r decays this leads to a 90% confidence 
level upper limit for the semileptonic decay of 

B(B° -)• A+pe~ V e ) < 1.4 x 10“ 4 . (2.7) 


This result shows a slight tension with the predictions given in eq. ( |2.4[ ) and ( |2.6[ ). But 
given the limitations of these predictions the isospin prediction might be the most reliable 


one. 









3. The BABAR experiment 


The BABAR experiment, operated from 1999 to 2008, was designed and built to perform 
a systematic study of CP-asymmetries in the decays of neutral /i-mcsons. Furthermore, 
BABAR allowed a sensitive measurement of the CKM matrix element 114ft I an d obser¬ 
vations of rare D, B and r-decays. Together these results are capable of putting con¬ 
straints on fundamental parameters of the Standard Model. In addition a wide spectrum 
of physics topics, like baryonic B -decays or charm- and tau-physics, could be studied 
with the BABAR detector. For the investigation of /i-mesons BaBAR was operated on the 
energy of the T(4S') (10.58 GeV) resonance, which decays into a BB pair with a proba¬ 
bility of more than 96%f7]|. The data set collected on this resonance is the starting point 
for the presented analysis. 

The experimental setup, shown in Fig. [37T is described in detail in the next sections, 
following the description given in lUOll . 



Figure 3.1: A schematic view of the SLAC site with the e + e accelerator and the BaBAR 
detector 
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3. The BaBAR experiment 


3.1 The e + e accelerator 


Electrons and positrons were accelerated in the 3.2 km long Linac up to energies of sev¬ 
eral GeV and injected into the PEP-II storage rings, where they were brought to collision 
inside the BaBAR detector. 

Inside PEP-II, electrons were stored in the high energy ring (HER) with an energy of 
9 GeV, while the positrons were stored in the low energy ring (LER) with an energy of 
approx. 3.1 GeV. These energies meet the requirement of a center-of-mass energy y/s 
equal to the mass of the T(4S) resonance, which decays into BB pairs, half of the time 
into a B°B°- and the other half into a B + B~- pair. 

The cross-sections for the different reactions possible in the e + e~ collision at 

Bhabha- scattering (V 


10.58 GeV can be seen in table 3.1 


—y e^e" 


s = 

) has by far the 


highest cross-section, while the other reactions are near 1 nb. Consequently, in most of 
the collisions, no 65-pair is produced. 

The unique chance BaBAR offered were the asymmetric beam energies, leading to a boost 
of the center-of-mass system into the direction of the electron beam. The boost is crucial 
for studies of CP asymmetries, by allowing a measurement of the difference in the decay 
times of the two B mesons. Therefore, the decay vertices of the 5-mesons have to be 
measured. The decay time difference can now be estimated by measuring the distance 
between the two vertices. 


Table 3.1: Production cross-sections at y/s = 10.58 GeV. The e + e cross-section is the 
effective cross-section within the experimental acceptance (8). 


e + e —> 

cross-section (nb) 

bb 

1.05 

cc 

1.30 

ss 

0.35 

uu 

1.39 

dd 

0.35 

A 

+ 

0.94 

p+ll 

1.16 

e + e^ 

« 40 


3.2 The BABAR detector 

The foremost requirement on the detector was a maximal acceptance in the center-of-mass 
system. Due to the asymmetric beam energies the decay products were boosted in the 
forward direction of the laboratory frame. Thus, an asymmetric detector was necessary 
to optimize the detector acceptance. Furthermore, an excellent vertex resolution, as well 
as a good discrimination between e, /i, n, K, and p over a wide kinematic range and the 
capability to detect and identify neutral particles were necessary. 

BaBAR has been designed to meet all of these requirements. Picture |T2| shows a schematic 
view of the BaBAR detector. The detector was arranged cylindrically around the beam pipe 
and consisted from the center outwards of the following subsystems: 
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• The Silicon Vertex Tracker (SVT), was providing precise position information on 
charged tracks, is the only tracking device for very low-energy charged particles, 

• the Drift Chamber (DCH) provided the main momentum measurement for charged 
particles, and helped in particle identification by measuring the energy loss of 
traversing particles, 

• the Detector of Internally Reflected Cherenkov light (DIRC), responsible for the 
identification of charged hadrons, 

• the Electromagnetic Calorimeter (EMC), was providing information about neutral 
particles as well as a good electron identification, 

• the superconducting coil, was providing a 1.5T solenoidal magnetic field for the 
momentum measurement in the DCH, and 

• the Instrumented Flux Return (IFR) was used for muon and neutral hadron identifi¬ 
cation. 

The different parts will be explained in the following section. A more detailed description 
of the detector can be found in [HI, [ITHl and ll22l . 
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3. The BaBAR experiment 



(a) 


Scale 


BABAR Coordinate System 


4m 


Detector 
IP. 



Instrumented 
Flux Return (IFR)) 
Barrel 



Superconducting 

Coil 


Cryogenic 

Chimney 



(b) 

Figure 3.2: The BaBAR detector: In the upper right comer the BaBAR coordinate system is 
shown, z is the direction of the electron beam, y points upwards while x points horizon¬ 
tally away from the center of the PEP-II ring, [(a)] shows the detector end view (looking 
into the direction of the HER beam), while [(b)] shows a longitudinal cut of the detector. 
































































































































































































































































3.2. The BaBAR detector 


19 


3.2.1 Silicon Vertex Tracker 


The Silicon Vertex Tracker (SVT) was the detector component with the smallest distance 
to the interaction point (IP). Consequently, it was the first component providing infor¬ 
mation on the flight path (track) of the particles emerging from the interaction point. 
Furthermore, it was the only source of information for low momentum particles that did 
not reach the drift chamber due to their deflection caused by the magnetic field. 

The SVT, shown in Fig. 3.3[ was located inside a 4.5 m long support tube. It consisted 
of five concentric cylindrical layers of double-sided silicon detectors. These five rings 
had radii of 3.3 cm up to 14.6 cm. If a charged particle crossed the SVT it generated 
electron-hole pairs leading to a signal. Due to the segmentation of the SVT, the signals 
in the different segments provided information about the path of the particle. Besides, 
the SVT also provided information for the particle identification of charged particles with 
momenta less than 700 MeV/c by measuring the rate of energy loss dE/dx [|23ll . Figure 
|3.4| shows the energy loss per travelled path against the momentum of the particle for dif¬ 
ferent particles in the SVT. 

The three rings closest to the IP delivered data about the position and the angle of a track 
for the track reconstruction with a spatial resolution of 10 — 15/im ||24l . The two outer 
rings had a resolution of 30 — 40 /im ll24l and were important for the measurement of the 
momentum of particles with a small transversal momentum as well as for the separation 
of geometrically close tracks. 



Figure 3.3: Longitudinal view of the SVT 
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3. The BaBAR experiment 



Figure 3.4: Energy loss per travelled path versus momentum in the SVT for different 
particles, e - electron, /i - muon, K - kaon, p - proton, d - deuteron 


3.2.2 Drift Chamber 


The drift chamber (DCH) was the main tracking device of the BaBAR detector. It measured 
at least 40 space coordinates per track in the central region, ensuring a high reconstruction 
efficiency for tracks with transverse momentum greater than 100 MeV/c. Further, the drift 
chamber contributed to the particle identification. Therefore, the rate of energy loss was 
measured, this rate is characteristical for the different types of particles. Figure [33] shows 
the rate of energy loss versus momentum. 

The drift chamber was a 280 cm long cylinder, with an inner radius of 23.6 cm and an 
outer radius of 80.9 cm. It consisted of 7104 hexagonal cells, which were formed by 6 
field wires in the comers of the hexagon and a sense wire in the center of the cell. The 
cells were arranged in 10 superlayers of 4 layers each. Axial (A) and stereo (U,V) su¬ 
perlayers alternate as it can be seen in Fig. [376 This led to a z—coordinate resolution of 
700 /jm [1251 . 

If a particle crossed the drift chamber it ionized the gas mixture (80% Helium, 20% Isobu¬ 
tane) inside the chamber. The generated ions and electrons drifted towards the field wires 
or to the sense wires, respectively, due to a difference in the voltage of 1960V between the 
sense and field wires. The generated signal could be used for tracking as well as for par¬ 
ticle identification by the measurement of dE/dx. Figure [3/7] shows the spatial resolution 
of the drift cells. 
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dE/dx vs momentum 



Track momentum (GeV/c) 

Figure 3.5: energy loss per travelled path versus momentum in the DCH for different 
particles (e - electron, /j - muon, K - kaon, p - proton, d - deuteron) 
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Figure 3.6: Cell layout in the BaBAR drift chamber 
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Figure 3.7: Spatial resolution of the drift cells 


3.2.3 Superconducting Solenoid 

Although the superconducting solenoid, together with the instrumented flux return, was 
the outermost detector component its function was closely related to the drift chamber. 
The solenoid, a superconducting coil, created a magnetic field of 1.5 Tesla inside the drift 
chamber leading to a deflection of the track of a charged particle. This deflection was used 
to measure the charge as well as the momentum of a charged particle with a transverse 
momentum resolution of o{pt)/pt ~ 0.45% + 0.13% • pr ll25ll . 


3.2.4 Detector for internally reflected Cherenkov light 

The Detector for internally reflected Cherenkov light (DIRC) was capable of identifying 
pions and kaons with momenta greater than 0.7 GeV/c as well as protons with momenta 
between 1.3 GeV/c and 4 GeV/c. This was complementary to the drift chamber, which 
could identify particles with lower momenta. 
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Figure 3.8: Fig. (a) shows a schematic view of the DIRC, the bar box containing the 
quartz bars and the standoff box are clearly visible. Fig. |(b)| illustrates the mode of op¬ 
eration of the DIRC. The emitted Cherenkov light is guided by total internal reflection to 
the standoff box where the Cherenkov angle is measured with the help of photomultiplier 
tubes. 


Fig. |3.8| shows a schematic view of the DIRC. It consisted of 144 bars of synthetic quartz 
with a refractive index of n ~ 1.474 surrounding the drift chamber. If a particle crossed 
these bars with a velocity larger than the speed of light inside the quartz it produced 
Cherenkov light. The Cherenkov photons were emitted at an angle 9 C relatively to the 
direction of the particle. This angle depends on the mass and momentum of the particle. 


cos 6 C = — = 

pn 


!+(? 


n 


(3.1) 


Inside the bars the Cherenkov photons were reflected many times until they entered the 
standoff box at the rear side of the detector. This box was filled with about 6000 liters of 
purified water and was equipped with nearly 11000 photo multiplier tubes to detect the 
Cherenkov angle. Fig. [379 shows the discrimination power of the Cherenkov angle. The 
angular resolution of the DIRC for a single photon was 7 mrad 
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Figure 3.9: Cherenkov angle against momentum for di 
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3.2.5 Electromagnetic Calorimeter 


The BaBAR electromagnetic calorimeter (EMC), as shown in Fig. |3. 10 was a hollow 
cylinder surrounding the drift chamber. The calorimeter barrel consisted of 5760 thallium- 
doped Csl crystals arranged in 48 rings of 120 crystals, while the End Cap was composed 
of 820 crystals. 




Figure 3.10: schematic view of the EMC, showing the calorimeter barrel and the forward 
End Cap. 


The calorimeter was designed for excellent efficiency as well as good energy ( cr E /E ) and 
angular resolution of the energy range 20 MeV to 9 GeV. A detailed study [[271 gave 


~E 


(2.30 ±0.03 ±0.3)% 
E( GeV) 


(1.35 ±0.08 ±0.2)% 


&6 ~ C<f> 


(4.16 ± 0.04) mrad 
\/E( GeV) 


(3.2) 

(3.3) 


Here, the © denotes a quadratic summation. Due to the asymmetric design the calorime¬ 
ter covered a solid angle of —0.775 < cos 9 < 0.962 in the laboratory frame and 
—0.916 < cos 0 < 0.895 in the center-of-mass frame [J8). 

High energetic photons entering a crystal were converted into e + e pairs by interacting 
with the crystal. The created electrons and positrons emitted bremsstrahlung photons 
which could convert into e + e _ pairs again. An electromagnetic shower developed. The 
generated photons could be absorbed by the scintillator material leading to an excitation 
of the crystal atoms. When the atom returned into its groundstate it emitted the absorbed 
energy as light. This light could be detected by photo diodes glued to the rear end of the 
crystals. Here the amount of measured light was proportional to the energy lost inside the 
crystals by a particle crossing the EMC. 
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In addition to the detection of photons the calorimeter was an important detector compo¬ 
nent for the discrimination between electrons and other particles. Details on the electron 


identification with the help of the EMC are given in section 3.3.3.1 


3.2.6 Instrumented Flux Return 

The outermost layer of the BaBAR detector was the instrumented flux return (IFR). Its 
main purpose as a detector component was the identification of muons as well as neutral 
hadrons like the K\ mesons. Consisting of a central part (Barrel) and two End doors the 
IFR covered a solid angle range down to 300 mrad in the forward, and 400 mrad in the 
backward direction [0. In the initial setup the IFR was instrumented with 806 Resistive 
Plate Chambers (RPCs), arranged in 19 layers in the barrel and 18 layers in the End Caps. 
Already in the first year of data taking the RPCs showed serious aging problems and had 
to be replaced by second generation RPCs and Fimited Streamer Tubes (FSTs). The final 
configuration of the IFR consisted of 12 layers of FSTs in the barrel and 16 layers of 
second generation RPCs in the forward End Cap. In the backward End Cap the original 
RPCs were retained. During the upgrade to FSTs the remaining flux return slots were 
filled with brass absorber plates, and some external steel plates were added. Thereby the 


pion rejection ability of the muon identification algorithm, described in section 3.3.3.2 
was improved lf28l . A sketch of the final IFR configuration can be seen in Fig. |3.11 



Figure 3.11: Final layout of the IFR with barrel sextants, forward (FW) and backward 
(BW) end caps, the dimensions are given in mm. 


3.2.7 Trigger 

The BaBAR trigger consisted of two levels: a hardware level (called Fevel 1) and a soft¬ 
ware level (called Fevel 3) and had to decide which events observed by the BaBAR detector 
were interesting enough to be kept and recorded for later analysis. 

The Fevel 1 trigger system consisted of four subsystems: the charged particle trigger 
(DCH trigger - DCT), the neutral particle trigger (EMC trigger - EMT), the cosmic trig¬ 
ger (IFR trigger - IFT) and the Global Fevel Trigger (GET). 

The DCT and EMT received information from the Drift Chamber and the Calorimeter, 
respectively, processed it and sent the condensed information to the Global Trigger. This 
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trigger tried to match the angular locations of calorimeter towers and drift chamber tracks 
and generated Level 1 trigger signal, which were passed to the Level 3 trigger. In addition 
to the information from the DCT and EMT the Global Trigger also uses IFT information 
to trigger on cosmic rays and /i-pair events. 

After the Level 1 trigger the Level 3 trigger, operating on an online farm, analyzed the 
event data from the DCH and EMC in conjunction with the Level 1 trigger information for 
further background reduction. Besides the physics filter this trigger stage also performed 
Bhabha veto and the selection of calibration events as well as critical online monitoring 
tasks Il29l . 

3.3 Event reconstruction 

The events that have been selected to be kept were passed to the reconstruction software 
which reconstructed the tracks in an event and assigned particle hypotheses to the tracks. 
During the reconstruction neutral and charged particles had to be considered separately. 
Charged particles interacted with every detector component and left a track inside the 
detector that could be reconstructed. In contrast, neutral particles only interacted with the 
EMC and the IFR. Consequently, their reconstruction relied on information delivered by 
these two detector components like their shower shape in the EMC and the energy deposit 
in the detector. 

3.3.1 Neutrals 

All neutrals in the BaBAR framework are based on CalorNeutral, a standard list cre¬ 
ated during online production. This list contains all EMC clusters that are not assigned to 
any track in the event. For all these entries the photon mass-hypothesis is applied. 

Based on CalorNeutral BaBAR uses two photon lists, namely GoodPhotonLoose 
and GoodPhotonDefault 113011 . The first one contains all candidates from Calor¬ 
Neutral satisfying the following criteria: 

• Minimum raw energy of 0.03 GeV 

• Maximum Lateral Moment LAT of 0.8. 

The GoodPhotonDef ault list contains all candidates from GoodPhotonLoose with 
a minimum raw energy of 0.1 GeV. 

3.3.2 Track reconstruction 

For charged particles a multi-step track reconstruction takes place. The first step is to 
search for track points in both tracking devices (SVT and DCH) separately. Due to the 
multi-layer design of the SVT and DCH it is possible to reconstruct the path a particle 
took inside these detector components, therefore a curve is fitted to the track points in 
the respective subdetector. Afterwards the reconstruction program tries to assign partial 
tracks inside the SVT and in the DCH to each other. After a successful assignment the 
combined track is fitted again to provide a good momentum measurement. 

The least stringent requirement for an accepted track is a successful reconstruction of the 
track as well as a successful momentum measurement. A track fulfilling these conditions 
is assigned to the ChargedTracks list under a pion hypothesis. 

Table |3.2| shows the requirements for the tracking lists used at BaBAR , as they can be found 
in OT1 . The different selection criteria shown in table |3.2| are: 
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• transverse momentum p t [ GeV/c] 

• momentum p [ GeV/c] 

• distance of closest approach (DOCA) to the z-axis in the xy -plane d 0 [cm] 

• ^-coordinate of the point of closest approach z 0 [cm] 

• fit probability P(x 2 ) 

Table 3.2: Tracking lists used at BaBAR and the selection criteria for these lists 


tracking list 

Pt 

p 

Mol 

Mol 

p 

ChargedTracks 

- 

- 

- 

- 

> 0 

GoodTracksVeryLoose 

- 

- 

< 1.5 

< 2.5 

> 0 

GoodTracksLoose 

> 0.05 

< 10 

< 1.5 

< 2.5 

> 0 


For the latter neutrino reconstruction we have to rely on ChargedTracks since iden¬ 
tified Vo tracks, including K s , A and converted 7 are not included in the more stringent 
lists GoodTracksVeryLoose and GoodTracksLoose. 

3.3.3 Charged Particle Identification 

The identification of charged particles is crucial for many analyses performed at BaBAR. 
For a high quality of the identification all BaBAR subdetectors contributed in a comple¬ 
mentary way to charged particle identification. The SVT and DCH provides c\E/dx mea¬ 
surements, the DIRC provides a velocity measurement, the EMC discriminates electrons, 
muons and hadrons according to their energy deposit and their shower shape while the 
IFR characterizes muons and hadrons according to their different interaction pattern. 

3.3.3.1 Electron identification with the EMC 

For the electron identification with the calorimeter we have to distinguish between two 
methods. 

1. Electron identification using E/p 

2. Electron identification using the shower shape 

For the first method the ratio E/p is measured, where E is the energy of a shower in 
the calorimeter, and p is the measured three-momentum of the corresponding charged 
track. When an electron enters the calorimeter it produces an electromagnetic shower, 
depositing its energy in the calorimeter. Thus, the ratio E/p is expected to be close to 
unity for an electron. In contrast muons and charged hadrons deposit only a fraction of 
their energy in the calorimeter, leading to smaller values of E/p. In addition to the good 
separation of electrons this method can also be used for a discrimination of muons against 
hadrons. While muons, as single minimum-ionizing particles, have a well-defined peak 
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in the E/p distribution, hadrons have additional tails at higher E/p values. A caveat 
of the E/p method is that hadrons interact electromagnetically as well as hadronically 
with the calorimeter. In the latter case they initiate a hadronic shower, depositing a large 
fraction of their energy in the calorimeter. Since the resulting E jp values are rather 
large a discrimination against electrons by E/p alone is difficult. An improvement of the 
electron identification can be achieved by using the different shower shape for electrons 
and hadrons. While electrons deposit most of their energy in two or three crystals hadronic 
showers are more extended. A quantity reflecting this difference is the lateral moment 


LAT 



LAT 


(3.4) 


Here, the energies E, are ordered according to their value (E) > E 2 > ... > E N ), with 
N being the number of crystals associated with the shower. r t and o l are the polar coor¬ 
dinates of the crystal in the plane perpendicular to the line pointing from the interaction 
point to the center of the shower. r 0 is the average distance between two crystals (« 5 cm). 
Since the summation omits the two crystals with the highest energy deposit this quantity is 
expected to be small for electromagnetic showers. An even better discrimination between 
electrons and hadrons is achieved by including information about the azimuthal distribu¬ 
tion of the shower. In general electromagnetic showers are expected to be isotropic in </>, 
while hadronic showers are far more irregular in (f>. 

3.3.3.2 Muon identification with the IFR 

The first step for muon identification with the IFR is the track reconstruction. Due to 
the low occupancy this is an uncritical task, since only a tiny fraction of tracks overlap 
each other. First, tracks are reconstructed in each sector using a clustering algorithm. In 
the second step the clusters in different sectors are merged, using the extrapolation of 
the charged tracks measured by the tracking systems (SVT and DCH) into the IFR. Such 
composite clusters are considered as candidates for muons or charged hadrons. 

For the muon identification the algorithm has to decide, whether the detected track has 
been produced by a muon or a pion. The other hadron candidates, kaons and protons, are 
identified by the other particle identification sub-systems since they show a similar sig¬ 
nature like the pion in the IFR. For the discrimination of muons against charged hadrons 
different IFR variables are used, e.g. the number of strips in IFR cluster, the number of 
measured interaction lengths, and the continuity of IFR hits in the 3-D IFR cluster [|8ll32|. 

3.3.3.3 BDT selectors 

For muons we apply the BDT selector, which is based on a Bagged Decision Tree (also 
Bootstrap Aggregating Decision Tree). A decision tree splits an n dimensional input set 
(with M entries) into rectangular subsets (nodes), where for each split all variables are 
considered and the split that leads to the largest increase in the figure of merit (e.g. the 
muon efficiency) is selected. This split is repeated recursively for each of the resulting 
nodes. The recursion ends when the number of entries in a node hits a pre-defined min¬ 
imum m, or if the figure of merit doesn’t change significantly. If the majority of entries 
in this node are signal entries this node is classified as a signal node, otherwise it is a 
background node. To enhance the reliability and reduce the variance of this procedure 
many trees are trained on bootstraped replicas of the training set. The bootstraped replica 
is obtained by sampling with replacement from the original training set, until the size of 
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the replica equals the size of the original training set. For an improvement of the perfor¬ 
mance compared to a single tree the classifier used has to be sensitive to small changes 
in the training set. This is obtained by setting a small value for m, and thus overtraining 
the trees. While each tree has a poor predictive power the final vote, a majority vote of all 
trained trees, has a high predictive power lf33l . The major goal of the BDT selectors is to 
provide a constant muon efficiency over time, momentum p and polar angle 6. While the 
last two are easy to understand the constant efficiency over time incorporates the aging of 
the detector components for the muon identification, as well as upgrades of the IFR. The 
muon efficiencies for the different PID lists can be found in Tab. 13.31 Details on the BDT 
input variables and performance can be found in lf34l . 

Table 3.3: The target efficiencies for the muon BDT lists. While in the first four lists a 
constant muon efficiency was targeted, the following four lists aimed at a constant pion 
mis-ID. The last two lists are specially designed for muons with momenta in the range 0.3 
to 0.7 GeV/c. 


List name 

muon eff. 

pion mis-ID 

muBDTVeryLoose 

90.0% 

variable 

muBDTLoose 

80.0% 

variable 

muBDTTight 

70.0% 

variable 

muBDTVeryTight 

60.0% 

variable 

muBDTVeryLooseFakeRate 

variable 

5.0% 

muBDTLooseFakeRate 

variable 

3.0% 

muBDTTightFakeRate 

variable 

2 .0% 

muBDTVeryTightFakeRate 

variable 

1 .2% 

muBDTLoPLoose 

70.0% 

variable 

muBDTLoPTight 

60.0% 

variable 


3.3.3.4 KM selectors 


The KM lists are based on Error Correcting Output Code, that combines multiple binary 
classifiers to form a multiclass classifier. Here, the binary classifiers are trained differ¬ 
ently. In the case of the KM selectors seven Bootstrap Aggregate Decision Trees t, were 
trained according to Table 3.4 For the classification of a given track each classifier t, 
is asked to give an output between —1 and 1, resulting in a string of seven real values 
between —1 and 1. This string is then compared to the individual codeword for class k. 
For a kaon the codeword is the first row in Table |3.4[ for a 7r the second and so on. The 
comparison is here done by calculating the generalized Hamming distance H k , the sum 
of squared differences, to the codeword for each class k. The track is then assigned to the 
class with the lowest Hamming distance. In order to obtain multiple tightness levels we 
can use the distance ///. itself as well as ratios of distances. For a kaon list we can use IIk, 
/%/ II K , II ft /II K and H e /H K . For the other particle classes we use the analogue ratios. 
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Table 3.4: The used indicator matrix in the KM selector. Each entry indicates whether 
the given training sample should be treated as signal (1) or background (-1). 


Class 

to 

ti 

t‘2 

t'S 

U 

f-5 

to 

K 

1 

1 

1 

1 

1 

1 

1 

7T 

-1 

1 

-1 

1 

-1 

1 

-1 

P 

1 

-1 

-1 

1 

1 

-1 

-1 

e 

1 

1 

1 

-1 

-1 

-1 

-1 


This method allows us to control the probability for a given class as well as the misidenti- 
fication rate. More details on these selectors, especially on the chosen input variables for 
the seven classifiers can be found in If351 . 

For high flexibility special selectors for the pre-selection of recorded data, were intro¬ 
duced. These CombinedSuperLoose selectors combine the least restrictive selectors 
for a given particle type to allow the user to switch to his selector of choice in the event 
reconstruction. 
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4. Software and Datasets 

4.1 Software 

For the reconstruction of the decay we use the following packages inside the BaBAR frame¬ 
work 


• analysis-52 

• BetaMiniUser V00-04-05 

• PDT V00-07-00 

• workdir V00-04-21 


The event selection and background reduction uses the data analysis package ROOT ll36ll 
and iPython 071 with SciPy P8l . 


In the background reduction we apply a random forest, provided by SciPy, to distinguish 
between signal and background candidates. A random forest is a modification of the 
bagging method, as described in sect. |3.3.3.3[ where a large collection of trees is build, 
and the response is averaged over all trees. The benefit is that one can use models with a 
small predictive power, thus unbiased, for classification in this method P9l . For bagged 
trees the bias is the same as for the individual tree. In consequence the only improvement 
can be obtained in terms of the variance of the average 


per 2 + 


B 


P 2 
-cr , 


(4.1) 


which depends on the pairwise tree correlation p, the individual tree variance a 2 and 
the number of trees B. While the second term vanishes for large B the first term stays 
unchanged. 

The idea behind the random forest algorithm is to reduce the correlation p without increas¬ 
ing the variance a 2 . This is achieved by a random selection of the input variables during 
the tree-growing process. In the following we will concentrate on a bootstrapped dataset. 
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For bootstrapping the dataset Z, containing N candidates described by p variables, is di¬ 
vided into B datasets. For the division we draw from Z randomly with replacement B 
datasets Z\,Z B containing the same number of events N as the original dataset Z. 

The general algorithm of the training of a random forest is shown in algorithm |4.1| 


Algorithm 4.1 Training of a random forest ll39l 


1. for b = 1 to B: 

(a) Draw a bootstrap sample Z\, of size N from the training data 

(b) Grow a tree T& to Zb, by recursively repeating the following for each terminal 
node, until minimum node size ri nn n (number of candidates) is reached 

i. select m variables at random from the p variables 

ii. pick the best variable/split-point among the m 

iii. split the node into two daughter nodes 

2. output the ensemble of trees {Tb} f. 


The split is done by selecting the variable most suitable to discriminate signal from back¬ 
ground, and dividing the dataset into two hemispheres. Subsequently each hemisphere is 
handed over to one of the two daughter nodes. Finally, each node l represents a subregion 
Ri, with N[ observations of the whole input data set. The proportion of class k in this 
region is thus given by 

P ik = 1 = k )> (4 - 2) 

where Xi denotes the candidate to be classified and y, the class of x t . The function /(y, = 
k) returns 1 if y ?; — k and 0 otherwise. For final nodes all observations in the node are 
classified as the class with the highest value for p tk . 

In the application phase each tree returns a classification prediction Cb(x). The resulting 
classification prediction from the random forest is then the majority vote of C k (x), with 
b=l...B. 

The relevant tuning variables here are the number of trees B, the minimal node size n min 
and the number of randomly to choose variables m. 

4.2 Datasets 

Data taking on the T(4S) resonance at BaBAR took place in six run periods between 
February 2000 and August 2007. The data set used in this analysis comprises the complete 
data collected by BaBAR on the T(4S') resonance, with an integrated luminosity of 

£onPeak = (424 000 ± 1900 syst ) pb- 1 . (4.3) 

This corresponds to a total number of BB pairs of 

N bS = 470 960 000 ± 116 000 stat ± 2 826 000 syst . 


(4.4) 
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Table 4.1: Monte Carlo modes used for background studies. Given are their SP number 
in BaBAR bookkeeping and the contained number of events. 


mode 

SP number 

number of events 

e + e~ -A qq, q — u, d, s 

998 

2190 254 000 

e + e~ -A cc 

1005 

1127360 000 

e + e~ -A B + B~ 

1235 

707282 000 

e + e“ -a B°B° 

1237 

716 219 000 


In addition to the OnPeak data set various Monte Carlo modes have been used to study 
background behavior and for the training of the random forest. The used background 
modes are listed in Table 14. ll 


For the signal description we chose two different approaches, a simple phase space model, 
and a simulation reproducing the threshold enhancement seen in various baryonic B- 
decays (see sect. |2.2| ). For the first one, the Phase Space (PS) model, we simulate a 
/i- meson decaying directly into the final state particles yl+, p, £~ and Vg. Here, the four- 
momenta are distributed uniformly in phase space. This model is intended for studies of 
the systematic uncertainties arising from the chosen decay model. For the second model, 
the Weak Interaction (WI) model, we assume that the hadronic form factor is given by the 
decay B -A A^pn~. For this purpose we defined a meson pole M, decaying into the 
baryon anti-baryon pair A+p. 


B -A Mi vi 
M -AA+p 
At -a pK~TT H 


(4.5) 


Mass and width of the M are chosen to reflect the properties of the enhancement known 
from other decay modes, especially from B~ -A A~ p7r~ [[2l. The mass is chosen in such 
a way that the mass peak is slightly above threshold, at 3.225 GeV/c 2 and the width is 
derived from the m(A+p) distribution in 0 (Fig. 4.1) to be 200MeV/c 2 . The semi- 
leptonic decay B~ -A Mi~Vg is modeled according to the weak matrix element 


\M \ 2 = ( PbPu)(pePm ), 


(4.6) 


under the assumption that the spectator system collapses into one particle. The subsequent 
decay M -a A+p is simulated according to phase space. 


The WI model is expected to give a more realistic momentum and energy spectrum for 
the charged lepton and the neutrino as well as a more realistic m(A+p) distribution. Fig. 
4.2[ 4.3 show a comparison of energy and momentum for both models on generator level 


in the center-of-mass frame. For the invariant A+p mass a comparison on generator level, 
as shown in Fig. 4.4, shows a peak above threshold for the Weak Interaction model, while 
the Phase Space model shows a broad spectrum. 


All signal modes, generated using EvtGen [f40ll for the decay simulation and GEANT 
Pm for modelling the detector response, are given in Table |4.2[ 
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4.4 4.6 


m A _(GeV/c-) 


Figure 4.1: Invariant A+p mass for the decay B —> A+pn , divided by the expectation 
from a phase space model. 




(a) (b) 




(c) (d) 


Figure 4.2: Comparison of the charged lepton (left) and neutrino (right) energy for the 
PS and WI model, scaled to the same integral. The first row shows the electron, and the 
last row the muon channel. 
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(a) (b) 




(c) (d) 

Figure 4.3: Comparison of the charged lepton (left) and neutrino (right) momentum for 
the PS and WI model, scaled to the same integral. The first row shows the electron, and 
the last row the muon channel. 




(a) (b) 

Figure 4.4: Comparison of the invariant A c p mass for the PS and WI model, scaled to 
the same height. Fig. |(a)| shows the electron, and|(b)|the muon channel. 
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Table 4.2: Monte Carlo modes used for signal studies and the number of generated events. 
The upper half shows the direct decay of the B into the final state, described by a simple 
phase space model. The lower half gives the modes reproducing the threshold enhance¬ 
ment. 


mode 

number of signal events 

B~ —> A+pe~V e 

4180 000 

B~ ->■ A+pp-Vf, 

4 220 000 

B° —> A+p7r + e-V e 

4 230 000 

B° —» A+pn + p~V^ 

4 260 000 

B~ —* Me~v e , M —> A+p 

2 867841 

B~ —> M —> A+p 

3 098 301 



5. Event reconstruction 


Subject of the present work is the study of the two baryonic 5-decay modes B~ —y 
A+pe~V e and B~ —>■ yl+p/r _ P M . The reconstruction of the B candidate is done in four 
separate steps. In the first step a yl+ candidate is reconstructed which is used in the second 
step to reconstruct the visible part of the B decay, namely A~pY. henceforth denoted as 
Y system. 


B —» A+pl u ( (5.1) 

Y 

The third step is the reconstruction of the neutrino as missing energy and momentum in 
the event. In the last step the reconstructed neutrino is combined with the Y to form a B 
candidate. 


5.1 A+ reconstruction 

The yl+ is reconstructed in its dominant decay mode yl+ —>• pK~ir + , which has a branch¬ 
ing fraction of (5.0 ± 1.3)% f7). The p, K~ and 7r + candidates are combined and fitted 
to a common vertex to form a /1+ candidate. A ,4+ candidate is accepted if the vertex 
fit with the TreeFitter algorithm is successful and the invariant mass of the candi¬ 
date is within the interval from 2.235 to 2.332 GeV/c 2 . The particle identification (PID) 
requirements for the three input particles are listed in Table|5.1} 


particle 

PID list 

p 

pCombinedSuperLoose 

K- 

KCombinedSuperLoose 

7T + 

piCombinedSuperLoose 


Table 5.1: PID lists used for the At reconstruction. 







40 


5. Event reconstruction 



Figure 5.1: Fitted /!+ mass distribution for OnPeak data. 


Parameter 

fitted value 

AW 

5348820 ± 2969 

N sig 

237499 ± 1924 

p 

2.285446 ± 0.000029 

a 

0.004489 ± 0.000034 

bi 

-0.033696 ± 0.00076 

b 2 

-0.059378 ± 0.00095 


Table 5.2: Fit parameters for the fit to the A+ mass in OnPeak data. 


5.1.1 yl+ mass constraint 

In order to improve the resolution of the reconstructed events a mass constraint on the 


nominal /l ( f mass is applied prior to the Y reconstruction (see section 5.2). The standard 


mass value for the constraint is the one used for the Monte Carlo production which is 
2.2849 GeV/c 2 . We use this value in the reconstruction of Monte Carlo events. For data a 
precise measurement of the A+ mass shows a momentum dependence of the mass mean 
value as well as a bias introduced by the SVT material density used in the reconstruction 
@21. To determine the optimal mass value for the constraint we perform an extended 
maximum-likelihood fit to the m{pK~ -k + ) distribution in data. As fit function we use a 
second order Chebychev polynomial for background and a Gaussian for the signal de¬ 
scription. The fit can be seen in Fig. 5.1 , while the fit parameters are given in Table |5.2j 
We decide to constrain the A+ mass in data to 2.2854 GeV/c 2 . 


5.2 Reconstruction of the visible decay products 

For the reconstruction of the Y system the previously reconstructed /1+ candidate is com¬ 
bined with an p, and an e~ or p~ candidate. The particle identification at this stage is 
based on the PID lists given in Table |5.3[ To make sure that the lepton lies within the 


acceptance region of the SVT, DCH and EMC (excluding the not well calibrated part 
of the EMC in the backward region) the polar angle 0\ of the lepton has to be within 
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0.41... 2.37 rad. The resulting Y candidate is discarded if a fit of the daughters to a com¬ 
mon vertex is not successful. To select those Y candidates with a four-momentum p Y 
consistent with a B~ —* Yu —* decay we use the angle between the B meson and 

the Y candidate Oby (defined in the center-of-mass system). 

In semileptonic B decays the four-momentum of the neutrino can be expressed as 

pi = 0 = {pb - PY ) 2 = Mg + My - 2 (E b E y - \pb\\py\ cos 9 by ). (5.2) 


Here, Eg and \p B \ can be derived from the center-of-mass energy. For M B the world 
average value can be used. Under the assumption that we have a perfectly reconstructed 
semileptonic decay, we can determine cosd BY 


COS 6 B Y = 


2 E b E y - M 2 B - Ml 
^\Pb\\py\ 


(5.3) 


To retain only physical values of cos 0 BY and to suppress background from wrongly re¬ 
constructed Y candidates we require | cos 6 BY < 1.2, allowing for resolution effects in 
the reconstruction of this quantity. A comparison of OnPeak data and the WI signal Monte 
Carlo is shown in Fig. |5.2| For light hadronic final states we would expect a strong en¬ 


hancement at positive values for signal events, while background events (which make up 
most of the OnPeak data) should show no such peak. This strong difference is smeared 
out when considering a heavy final state like in the case of B~ —> A~lp£~i 7 where the 
B direction is dominated by the A~p system. In consequence the peak for signal Monte 
Carlo is broadened, while background events tend to prefer values near +1 as well. 


Table 5.3: Particle ID lists used for the Y reconstruction. 


particle 

PID list 

P 

pCombinedSuperLoose 


eCombinedLoose 

p- 

muCombinedVeryLooseFakeRate 


5.3 Neutrino reconstruction 

Under the assumption that there are no undetected particles in the event the neutrino can 
be reconstructed as missing energy and missing momentum of the event. In an e + e - 
experiment like BaBAR the energy and momentum of the CM system are well known for 
each interaction. To determine the missing energy and momentum in principle all seen 
particles (E^pA have to be subtracted from the CM system ( E C m, Pcm )• For neutral 
particles, mainly photons, the GoodPhotonLoose list is used. For charged particles 
the ChargedTracks list has to be used since the higher quality list GoodTracks- 
VeryLoose does not contain neutral particles that are converted into charged particles 
inside the detector, e.g. 7 —* e+e~, or K° s -A h + ti~ . 

(.H'viPv) ( E m [ ss , Pmiss) (F/cmjPgm) ( ^ ^ Ej , l ^ pi ) 


(5.4) 
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5. Event reconstruction 




Figure 5.2: Comparison of cos 8by in OnPeak data and WI signal Monte Carlo for the 
electron [(a)] and the muon [(b)] channel. For both plots a subset of all reconstructed candi¬ 
dates was used, requiring the m^s, AE and m(A +) cuts described later on. 


The major problem is to assign the correct particle hypothesis to the charged tracks. In the 
ChargedTracks list the default hypothesis for each track is the pion hypothesis, i.e. 
the mass for each track is set to the pion mass. In order to circumvent this potential bias 
we assign more realistic particle hypotheses to the tracks for the neutrino reconstruction. 
Therefore each element of ChargedTracks is compared in a well defined order with 
stringent particle identification lists, given in Table |5.4 If the track is not used inside the 
yl+ or Y the according particle hypothesis is applied to the track, otherwise the same hy¬ 
pothesis as in the /1+ or V has to be used. The comparison with the particle identification 
lists is done in the following order: 


1. if track £ electron list —>• electron PID is assigned 

2. else if track £ kaon list —» kaon PID is assigned 

3. else if track £ muon list —>■ muon PID is assigned 

4. else if track £ proton list —> proton PID is assigned 

5. else pion PID is assigned. 


Table 5.4: Particle ID lists used for the neutrino reconstruction. 


particle type 

PID list 

electron 

eKMTight 

kaon 

KKMTight 

muon 

muBDTVeryLoose 

proton 

pKMTight 
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m ES (GeV/c 2 ) m Eg (GeV/c 2 ) 


(a) B A/ pe v e Signal Monte Carlo (b) B —> A/ p/i Signal Monte Carlo 

Figure 5.3: The AE-tues plane in WI signal Monte Carlo for the two B~ —> A/pYui- 
signal modes. 


5.4 B reconstruction 

Subsequently the reconstructed Y and v candidate are combined to a B candidate. Kine¬ 
matic consistency of the candidate with a B decay is checked using two variables, the 
beam energy substituted mass, m E s, and the difference between the reconstructed and 
expected energy of the candidate, A E. They are defined as 

A E = E* b - y/s/2 

{s/2 + Pb ■ Pbeams ) 2 -3 

P 2 PBi 

^ beams 

where s refers to the total energy squared of the CM system, p B and Pbeams to the mo¬ 
mentum of the B and the e + e“ in the laboratory frame and E* B to the energy of the B 
candidate in the CM frame. For the sake of readability we set c = h = 1. 

The resulting B candidate has to pass loose cuts on m ES and A E. 

—2.0 GeV < A E < 2.0 GeV and m E s > 5.0 GeV/c 2 (5.7) 

The resulting two-dimensional AE-tues plane in WI signal Monte Carlo (via the JET SET 
model) for the two signal modes can be seen in Figure |5.3[ A clear signal peak is visible, 
making a measurement of these decay channels plausible if the branching fraction is rea¬ 
sonably large. Furthermore, the distributions show that the signal has large tails in m E s 
as well as in A E. 



(5.5) 

(5.6) 
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5. Event reconstruction 



6. Background suppression 


For a good separation of signal and background a detailed background study is necessary. 
Therefore, we identified two different classes of background. 

1 . continuum events from e + e _ —» qq (q = u, d , s, c) 

2. combinatorial background from other B decays 


2 . 0 . 

1.5l 

1.0 


> 

O 


: 


H 

< -0.5l 


-i.sl 


~ 2 S 




(a) 


(b) 


Figure 6.1: The AE-m E s plane for uds (a) and cc (b) Monte Carlo. 



mss (GeV/c ! ) m K (GeV/o 2 ) 


(a) (b) 


Figure 6.2: The AE-m E s plane for B + B (a) 


and B°B° 


(b) 


Monte Carlo. 
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6. Background suppression 


The two-dimensional AE-tues distribution for these two background classes after the 
decay reconstruction can be found in Figures |6.1| and 6.2 In comparison to the signal 
Monte Carlo distribution shown in Figure [53] these two background classes are smeared 
out over a large phase space region. In addition, the uds background shown in Fig. 6.1(a) 
is shifted to smaller tties values. Note, that the concentration of background events close 
the the signal region is caused by the large invariant mass of the Y system. 


6.1 At mass cut 


In order to suppress background from generic p, K~ and n + combinations in the /l ( f 
reconstruction we decide to use a ±3 o mass selection region around the central mass 
value of the A+. Due to the observed difference between OnPeak data and Monte Carlo 
events the cut has to be determined separately for both data sets. Therefore, we fit the 
m(A +) distribution in data and signal Monte Carlo with a Gaussian g(rri: p, o) for signal 
and a second order Chebychev polynomial p(m\ b\, b 2 ) for background. The fit results are 
given in Fig. |6.3[|6.4| and Table |6.1[ and |6.2 Due to the high statistics of the signal Monte 
Carlo sample a better fit would require additional terms for signal description. But for the 
purpose of a mass cut the shown description suffices. 



Figure 6.3: m(A +) fit in OnPeak data. 



Figure 6.4: m(A+) fit in B —» A+pe u e 
MC events. 


Table 6.1: Fit parameter for a fit to m(A +) Table 6.2: Fit parameter for a fit to m(A+) 

in data events. in Monte Carlo events. 


parameter 

value 

Nbkg 

749530 ± 1120 

N sig 

39835 ± 739 

p 

2.285453 ± 0.000068 

o 

0.004578 ± 0.000081 

h 

-0.04754 ± 0.0020 

t>2 

-0.06730 ± 0.0026 


parameter 

value 

Nbhg 

334006 ± 744 

N sig 

217413 ± 661 

p 

2.284760 ± 0.000012 

o 

0.004099 ±0.000013 

bi 

-0.06399 ± 0.0030 

b 2 

-0.21995 ±0.0038 


The fit returns a a of 4.58±0.09 MeV/c 2 for data and 4.10±0.02 MeV/c 2 for Monte Carlo. 
The larger value for data events is plausible taking into account the observed momentum 
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dependence of the mean value of the /1+ mass for data li42ll . Since we use 7l+ candidates 
from a relatively large momentum range this effect leads to a broader distribution. 

For convenience we decide to use the same width for the cut for both, OnPeak data and 
Monte Carlo, only shifted according to the different mean values, measured in section 


2.2716 < m{A+) < 2.2992 GeV/c 2 , (6.1) 

corresponding to ±3cr, and for Monte Carlo 

2.2711 < m(A+) < 2.2987 GeV/c 2 , (6.2) 


5.1.1 Therefore, we decide to use the following cut on the 7l+ candidate mass for data 


corresponding to ±3.4cr. 

6.2 Particle identification 

For the reconstruction of our signal candidates we use the PID lists used in the BaBAR 
internal data reprocessing. While the CombinedSuperLoose PID lists provide a high 
probability to correctly identify the detected tracks they have a high mis-identification 
probability as well. In order to reduce the number of mis-identified particles we decide to 
use the more stringent PID lists, given in Table |6.3| 

Table 6.3: Particle ID lists used for the signal extraction. 


particle 

PID list 

proton (yl+) 

pKMLoose 

kaon (7l+) 

KKMLoose 

pion (71+) 

piKMLoose 

proton 

pKMTight 

electron 

eKMTight 

muon 

muBDTTight 


6.3 Continuum background suppression 

For the reduction of continuum background ( qq , q = u, d. s. c) we decide to use a random 
forest. The number of split variables is set to v / p , where p is the total number of discrim¬ 
inating variables. The training data sets for signal and background equal each other in 
size. 

For continuum background we settle on three event shape variables, defined as: 


• The ratio of the second and zeroth Fox-Wolfram moment R 2 11431 for all charged 
tracks, defined as 


R 2 


Y!ij acks \Pi\\Pi\P2{ 

E tracks \ \ \ \ 

ij IPzIlPjl 


(6.3) 


with the Legendre polynomial P 2 and the angle 9 tJ between the momenta p, and p r 











48 


6. Background suppression 


• The sphericity S of the event li43l . derived from the sphericity tensor 


oaf) 


EiPtPi 

Ei \Pi \ 2 


where a,/3 = 1,2,3 correspond to the x, y and z components. 


(6.4) 


• The cosine of the angle between the thrust axis of the Y candidate and the thrust 
axis ll43l of the rest of the event cos A6Wust- The thrust axis is defined as the unit 
vector ft that maximizes the thrust T 


T 


max 

|n| = l 


y^tracfcs 


n-Pi 


E i I Pi 


(6.5) 


A comparison of these three variables between signal and background for the electron 


channel is given in Fig. 6.5 and for the muon channel in the appendix, Fig. A.l 




R, S 

(a) (b) 



(c) 


Figure 6.5: Comparison of WI signal Monte Carlo (blue histogram) and qq background 
Monte Carlo (green and red lines) for the three Random Forest input variables, |(a)|-R 2 , 

S, and [(c)] cos A9 { 


(b) 


7 thrust • 


The random forest (RF) has two major variables to be adjusted for optimal performance, 
the number of trees N trees and the minimum number of events per leaf N\ ea f. We optimise 
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these two variables by training random forests with both variables ranging from 10 to 
1000 and calculating the area A below the receiver operating characteristic (ROC) curve. 
A large value here points to a good performance. A plot of the area in dependence of 
Agrees and TVieaf can be seen in Fig. 6.6 The optimum for the RF configuration is at 380 
trees and 60 events per leaf, but as can be seen from Fig. 6.6 the variation in A is not very 
large (except close to N trees = 0), and in consequence the performance deficits by using 
a not optimal configuration are negligible. 



Figure 6.6: Area below the ROC curve in dependence of N trees and N\ 


leaf • 


For continuum background we decide to use the optimal set of N tiees = 380 and A) ea f = 
60 for the electron and the muon cha nnel. The classifier distribution for signal and back¬ 
ground after training is shown in Fig. 6.7 Maximizing S/\JS + B, with the signal yield 
S scaled to an expected branching fraction of 10” 4 and the background yield B (including 
BB background) scaled to data luminosity, returns an optimal cut of > 0.4 for the electron 
case, preserving 88% of the signal, while reducing continuum background to 41%. For 
the muon channel we obtain an optimal cut of > 0.4 as well, preserving 87% of the signal 
and reducing background to 42%. A comparison of signal and background efficiency in 


dependence of the classifier cut is shown in Fig. 6.8 
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6. Background suppression 



elf 


(a) 


(b) 


Figure 6.7 

random forest: 


The classifier distributions for signal and background after training of the 

(a)l-E> - —>■ A+pe~V e W^B~ -t A+pp-V^. 





(a) 


(b) 


Figure 6.8: Signal (red, solid line) and background (blue, dashed line) efficiencies in 
dependence of the cut on the random forest classifier: (a) B~ —» A+pe~ 

A+pp- 


(b) B~ ->■ 


v 
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6.4 B background suppression 

In the next step after continuum background reduction we have to reduce generic BB 
background as well. Therefore, we train an additional random forest, based on four input 
variables: 


• The lepton momentum in the CM system, 

• the angle between the lepton momentum and the momentum of the A+p system in 
the CM frame, 

• the vertex probability for the Y candidate, and 

• the transverse neutrino momentum in the CM system. 


A comparison of WI signal Monte Carlo and generic B Monte Carlo data is shown in Fig. 


6.9 for the electron and in the appendix (Fig. A.2) for muons. For the vertex probability 


signal as well as background contain events with a vertex probability of zero. But, the 
fraction of these candidates in the background is much larger. As shown in the inset 
the number of events with a probability larger 0.1 stays nearly constant for the signal 
simulation, while it converges to zero for background events. As for the qq background 






P x 2(Y) |pJJ (GeV/c) 

(c) (d) 


Figure 6.9: Comparison of WI signal simulation events and generic B decays: (a) \p' 
(b)lcos 0 Ma mP,4Y), andffdjW,,. 
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6. Background suppression 


we optimize the BB random forest as well. For performance reasons we decide to vary 
the number of trees between 100 and 500 and the number of events per leaf between 10 
and 200. The resulting distribution of the area below the ROC curve can be seen in Fig. 
6.10| The optimal RF performance is achieved with 450 trees and a minimum number 


of 40 events per leaf. As for the qq RF the variation of the area below the ROC curve is 
rather small, for N trees » 100. The classifier distributions for signal and background are 


shown in Fig. 6.11 



Figure 6.10: Area below the ROC curve in dependence of N trees and N\ ea ,{. 


6.4.1 BB Classifier as fit variable 


A possible fit variable is the Random Forest output for the BB background itself. This 
variable offers by construction a good separation of signal and background, as can be 
seen in Fig. |6.12 In addition the agreement between the background Monte Carlo and 
the data distribution within the uncertainties is reasonable for low values, although it 
seems to underestimate background a little. For large values the background Monte Carlo 
simulation overestimates data background. Another problem arises when it comes to find 
a functional description of the signal distribution. A closer look at large classifier ( elf ) 


values, as shown in Fig. 6.13 for the electron channel, shows a shoulder on the right hand 
side of the signal peak, as well as some structure on the left side. This would require a 
rather complex fit function, which strongly depends on the specific decay model. Thus, 
we decide not to use the classifier as a fit variable, but rather select candidates above a 
certain threshold in the classifier. 
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Figure 6.11: The classifier distribution s for signal and background after training of the 

—* A+pe~ 


BB random forest: 


(a) 


B~ 


Ve. 


(b) B —s- A+pp, v 
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Figure 6.12: Comparison of background Monte Carlo (blue squares), OnPeak data 
(green dots) and WI signal Monte Carlo (red diamonds) for the BB Random Forest clas¬ 
sifier. 
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(b) 


Figure 6.13: The BB classifier distribution for the WI electron (a) and muon [(b)] signal 
Monte Carlo. The blue squares show the WI model, and the green dots the phase space 
model. Shown is the range from 0.8 to 1.0. 
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6. Background suppression 



7. Analysis 


For the further analysis of the decay we select events above a certain threshold in the BB 
classifier. The optimal value of this threshold is determined by maximizing the statistical 
significance S/y/S + B, where S is the expected signal yield, and B the background 
yield. Here, we use all events that passed the previously described selection criteria. 
For the background yield we scale the qq and BB Monte Carlo data sets to OnPeak 
luminosity. For the signa l yie ld we vary the expected branching fraction between 0.5 x 
10 -5 and 5 x 10 -4 . Fig. |7.l| shows S/\/S + B versus the classifier cut for both signal 
channels. The maximal value for the electron channel is achieved for elf > 0.9, reducing 
background to 2% while preserving 36% of signal. The maximum for the muon channel 
is at > 0.85, reducing background to 3% while preserving 42% of signal. Both optimal 
cuts show no significant variation with the expected signal yield. 




(a) (b) 


Figure 7.1: S/\/S + B versus the BB classifier for the electron (a) 


and muon 


(b) 


channel. 


Figure |T2| shows a comparison of background and signal efficiency in dependence of the 
classifier cut for both signal channels. 
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7 . Analysis 




(a) 


(b) 


Figure 7.2: Signal (red, solid line) and background (blue, dashed line) efficiencies in 


dependence of the cut on the random forest classifier, (a) B —>■ Afpe v e , (b) B —y 
A+p^-v^. 


7.1 Fit variable 


For signal extraction we have two possible sets of variables to determine the signal yield, 
first the classic neutrino variables 


© TT7 ^ ^ _ 7 ) 

,/6 miss ^miss T^miss 

# ^miss/^-^miss 

• ml = (E* eam ^,E* Y y * 


where E hea _ m , E Y and p Y are measured in the CM system. For ml we neglect the small 
momentum of the B meson in the CM system, in consequence the neutrino momentum 
and the momentum of the Y system have the same magnitude. The second set consists of 


• A E 

• rn ES , 


as defined in eq. (5.5) and (5.6). 


Crucial for both variable sets is a good description of the background shape by back¬ 
ground Monte Carlo, and different shapes for background and signal. The next sections 
show a comparison of background Monte Carlo with OnPeak data and with WI signal 
Monte Carlo to decide on the variables for signal extraction. If the background descrip¬ 
tion from generic Monte Carlo is capable of describing data background we expect a good 
agreement between background Monte Carlo events and OnPeak data, with a possible 
variation due to a signal component in data. 









7.1. Fit variable 


51 


7.1.1 Neutrino Variables 


All three neutrino variables work well for low-mass final states like B —» nIv, but might 
have problems with high-mass final states, as it can be seen in the analysis of B~ —* 
D+K~£~T7 £ [44]. In this analysis the signal yield was extracted in m 2 njss , but in contrast 
to decays l ik e B —> niu background in this variable is not flat. In consequence, the signal 
peak had to be fitted onto a steep slope. The situation might be even more difficult in the 
present analysis, where the final state is about 0.8 GeV/c 2 heavier. 


A comparison between background Monte Carlo events, OnPeak data, and WI signal 
Monte Carlo data for these three variables is shown in Fig. 
background and signal Monte Carlo events for m 2 liss and m 2 n 


7.3 


The comparison of 
/ 2 E m iss shows that the 


distributions equal each other. This can be explained in terms of the large invariant mass 
of the Y system, leaving only a small fraction of energy and momentum for the neutrino, 
in consequence a neutrino signature can be easily faked by a wrong particle identification 
in the neutrino reconstruction, or a missing photon. 


In contrast m 2 is independent of the particle identification in the neutrino reconstruction 
and offers a small difference between background and signal. In addition the agreement 
between the OnPeak and the background Monte Carlo distribution is reasonably good, 
making m 2 a possible fit variable. The down-side of m 2 as fit variable is that a fit in this 
quantity requires to fit the signal on the right slope of the background distribution. 




Figure 7.3: Comparison of background Monte Carlo (blue squares), OnPeak data (green 


dots) and WI signal Monte Carlo (red diamonds) for the three neutrino variables [(a)] 


mi 


(b) rn~ miss /2E miss , and (c) 


mi 



































58 


7 . Analysis 


7.1.2 B Variables 


Two other possible target variables are ttt.es and A E for the reconstructed B meson. Fig. 


7.4 shows a comparison between data and Monte Carlo for these two variables. Com¬ 


paring the tttes distributions we see a good separation between background and signal, 
as well as a reasonable agreement between the background Monte Carlo and data distri¬ 
butions. This makes m E s a good choice as fit variable. In contrast the A E distribution 
shows a rather similar shape for background and signal Monte Carlo events. 



m Eg (GeV/c 2 ) AE (GeV) 


(a) 


(b) 


Figure 7.4: Comparison of background Monte Carlo (blue squares), OnPeak data (green 
dots) and WI signal Monte Carlo (red diamonds) for[(a)]m E s, and|(b)| AC. 


7.2 Signal extraction 


For signal extraction we decide to perform a simultaneous maximum likelihood fit in 
tttes and ml. Therefore, it is crucial to exclude a strong correlation between these two 


variables. Fig. 7.5 shows no correlation between these two variables, neither for signal 
nor for background. To exclude the possibility that the shape in one variable varies in 
dependence of the second variable, we compare the projections in slices of the second 


variable as well. Fig. |C.1 1 and [C2j show these projections for WI signal and background 
Monte Carlo. There seems to be no strong variation in shape parameters for neither signal, 
nor background. Thus, we can apply a factorization ansatz. In the following sections we 
determine the parameters of the background and signal probability density functions (pdf) 
by fitting Monte Carlo events. 


7.2.1 Signal parametrization 

For signal description we decide to use for ml a Cruijff function C(x: m 0 , cr L , ctr, oil, cur) 
defined as 


C(x; m 0 , <Jl, &r, oil, cxr) 


exp 


— (x—mo) 2 


2-a 2 R +a R -(x-mo) 2 


if x — m 0 > 0 


exp 


— ( x—mp ) 2 


2-a 2 L +ot L -(x-mo) 2 


if x — ttt 0 < 0, 


(7.1) 
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Figure 7.5: ml : m E s plane for WI signal Monte Carlo (a)|(b) and generic background 
Monte Carlo data 1(e)] , |(d)| For the electron channel |(a)||(c)| and for the muon channel |(b)| 


(d) 


and the sum of an ARGUS function A(x ; c, p, m 0 ) and a Novosibirsk function A f(x; x 0 , o, A) 
for m ES . 


S(x;c,p,m 0 ,x 0 ,(T,A,f 1 ) = ■ A(x;c,p,) + (1 - /i) - A f(x;x 0 ,a,A) (7.2) 


The ARGUS function A(x; c, p, m 0 ) is defined as 


A{x] c,p ) — x ■ 




(7.3) 


where m 0 denotes the upper boundary which coincides with half the center of mass energy 
in BaBAR, and can thus be taken from data as a conditional observable. The parameters p 
and c determine the shape of the ARGUS function. In the definition ll451 first used by the 
ARGUS collaboration p is fixed to 0.5. To reduce the number of free parameters we will 
do the same here. 

The Novosibirsk distribution J\f(x; x 0 , a, A) is defined as 


A F(x; xq, a, A) = exp 


1 Qngy ) 2 

2 A 2 



q y 


i A(x-x q) 


sinh 



Ay/\nA 


(7.4) 

(7.5) 


o 
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where xq represents the peak position, o the width of the peak, and A the asymmetric tail. 
For the fit only correctly reconstructed events are considered. Thus, we can determine the 
signal parameters, without a pollution from background events. 


The fitted ml distributions are shown in Fig. 7.6 and the fit parameters in Table |7.lj For 
the mss At the fitted signal distributions are shown in Fig. |7.7| and the fit parameters in 
Tab. O 




(a) (b) 


Figure 7.6: Fitted rn- distribution for WI signal Monte Carlo data: (a) for B —> 
A+pe~V e , and (b) for B~ —* 


Table 7.1: Parameters for the fit to the ml distribution for WI signal Monte Carlo events: 

for B~ —> A+pe¬ 


te) 


Vp 


and 


(b) for B —» A+pp v 


A 4 ' 


(a) 


parameter value 


m 0 

-0.035 

± 

0.005 

(7h 

0.326 

± 

0.004 


0.316 

± 

0.004 

OIL 

-0.023 

± 

0.005 

OtR 

-0.044 

± 

0.006 


(b) 


parameter value 


m 0 

-0.086 

± 

0.004 

(7l 

0.314 

± 

0.004 

ctr 

0.310 

± 

0.004 

OIL 

-0.035 

± 

0.006 

OiR 

-0.039 

± 

0.006 


In order to assess the quality of the fit we can check the binned two-dimensional pull p 

_ fi(ml, m E s) - m ES ) tn ^ 

Pi j V ' -OJ 

where iVj(m^,m E s) is the number of entries in the ?-th bin, and a the uncertainty on 
N t . The pull distributions are shown in Fig. |7.8| The pull plots show a systematic 
deviation at large m E s values and rn'l « 0.4 GeV 2 /c 4 . This deviation is not accounted 
for in the simultaneous fit, since an unbinned two-dimensional fit without correlations 
equals a simultaneous fit in the rn'l ar| fi m ES distribution. As for a simultaneous fit only 
the agreement in the projections is relevant, which is good. The pull plots are only shown 
for completeness. 
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(a) (b) 


Figure 7.7: Fitted rn es distribution for WI signal Monte Carlo data: (a) for B —> 

A+pe~v e , and (b) for B~ —* A+pp^v^. 


Table 7.2: Parameters for the fit to the m es distribution for WI signal Monte Carlo events, 
(a) for B~ —>■ A+pe~V e , and (b) for B~ —> A^pg~V^. 


(a) 


parameter 

value 

c 

-24.9 ± 0.6 

h 

0.514 ± 0.020 

x 0 

5.27930 ± 0.00015 

o 

0.0144 ± 0.0004 

A 

1.32 ±0.05 


(b) 


parameter 

value 

c 

-24.5 ± 0.6 

fi 

0.542 ± 0.021 

x 0 

5.27950 ± 0.00016 

o 

0.0142 ± 0.0005 

A 

1.34 ±0.05 


> 

1) 

o 



mj (GeV 2 /c 4 ) 



m 2 (GeV 2 /c 4 ) 


(a) 


(b) 


Figure 7.8: Pull plots for the fit to WI signal Monte Carlo data: (a) for B —* A^pe v e , 
and[(b)]for B~ —> A^pp~V^. 
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7.2.2 Background parametrization 


For background we use a Cruijff function for m 2 as well, and for rriEs an ARGUS func¬ 
tion. For the ARGUS we leave the exponent p floating to allow for a steeper distribution. 
The fitted projections are shown in Fig. |7.9| and |7.10[ and the parameters are given in Tab. 
7.3 1 and[7~4| The input data set for the fit consists of a luminosity scaled mixture of qq and 


BB Monte Carlo events, i.e. the relative contributions of qq and BB events to the com¬ 
plete input set are equivalent to the expected ratios in OnPeak data. The corresponding 
pull plots, shown in Fig. |7.1 1[ show no systematic deviations between the fitted function 
and the background Monte Carlo data. 




(a) (b) 


Figure 7.9: 


A+pe- 


and 


Fitted m 2 distribution for background Monte Carlo data: (a) for B —> 
(b) for B~ —> A+pp~v^. 


Table 7.3: Fit parameters for the fit to the m 2 distribution for background Monte Carlo 
events: 


(a) for B —>■ A+pe u e , and (b) for B —>■ A+pp v 


Al¬ 


ia) 


parameter 

value 

m 0 

-0.26 ± 0.04 

PL 

0.271 ± 0.030 

Or 

0.44 ± 0.04 

OIL 

0.01 ±0.04 

&R 

-0.10 ±0.08 


(b) 


parameter 

value 

m 0 

-0.268 ± 0.025 


0.283 ± 0.022 

or 

0.469 ± 0.029 

OIL 

0.007 ± 0.028 

OCR 

-0.13 ±0.04 
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Table 7.4: Fit parameters for the fit to the m es distribution for background Monte Carlo 
(a)jfor B~ —» A+pe~v e , and[(b)]for B~ —> A+pgTV^ 


events: 


m- 


(a) 


parameter 

value 

c 

-16.5 ± 1.1 

p 

0.45 ± 0.08 


(b) 

parameter 

value 

c 

-15.7 ±0.07 

p 

0.3976 ± 0.0010 




(a) 


(b) 


Figure 7.10: Fitted tubs distribution for background Monte Carlo data, (a) for B —> 
A+pe~V e , and (b) for B~ —» /l+p/r _ z7 M . 



Figure 7.11: 


A+pe 


and 


Pull plots for the fit to background Monte Carlo data: (a) for B —» 
(b) for B~ —* Tl+p/r - ^. 
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7.2.3 Fit validation 


In order to validate the fit procedure, and exclude any significant systematic uncertainties 
in the extracted signal yield, we prepare different mixtures of background and signal 
Monte Carlo data. While the used background set stays the same we vary the signal 
fraction from 0 to 100 events and compare the fitted signal yields with the size of the 
input data sets. Table 7.5| listing the fit result, shows no significant deviations between 
the size of the input data set and the fitted signal yield. Thus, we conclude that the fit 
procedure does not lead to a significant over- or underestimation of the signal yield and is 
save to use for signal extraction. The fits are shown in Fig. C.3 and C.4| 


Table 7.5: Size of the signal input data set, and the fitted signal yield for the electron and 
muon channel. 


signal events 

e fit result 

p fit result 

100 

80 ±40 

100 ± 70 

50 

30 ±40 

60 ±70 

10 

10 ±40 

20 ±70 

0 

-1 ±40 

30 ±70 


7.2.4 Fit to data 


We extract the signal yield for both signal channels on data separately. Therefore, all 
parameters of the background and signal pdfs are fixed to their optimal values, determined 
in section |7.2.1| and |7.2.2[ The fitted distributions and the pull distribution are shown in 


Fig. |7. 12 7.13 and 7.14 The obtained signal yields are 


N sig (B~ A+pe~v e ) = 5 ± 27, (7.7) 

N sig (B~ -)■ A+pp-V^) = -30 ± 50. (7.8) 

Although the yield for B~ —> A^pp~u IJi is negative both yield are compatible with zero. 


7.2.5 Statistical Upper Limit 

The significance for both decay channels is well below the threshold for an observation, 
and hence we can only give an upper limit for their branching fraction. The pure statistical 
upper limit is obtained using a Bayesian approach by integrating the likelihood obtained 
in the previous section from 0 to N, where N denotes the number of signal events at 
which the integral coincides with 90% of the integral from 0 to oo. The obtained upper 
limits for the signal yield at 90% confidence level are 


N sig (B ->■ A+pe u e ) < 73 
N s i s (B~ A+pp~V^) < 100. 


The projection of the likelihood onto the signal yield N s[g is shown in Fig. 


7.15 


(7.9) 

(7.10) 
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(a) 


(b) 


Figure 7.12: Fitted ml and m ES distributions of the fit to OnPeak data for the electron 
channel. 




(a) (b) 


Figure 7.13: Fitted nil and m E s distributions of the fit to OnPeak data for the muon 
channel. 


7.3 Efficiency calculation 


We determine the reconstruction efficiency by fitting WI signal Monte Carlo with the 
signal and background pdfs described before. Note, that in the determination of the signal 
pdf only Monte Carlo events with a correctly reconstructed signal decay were used. The 
fitted rn'l and m E s distributions, as well as the two-dimensional pull plot are shown in Fig. 


7.16 and 7.17 As expected only a small background fraction is contained in signal Monte 


Carlo, and the agreement between Monte Carlo data and the fit function is adequate, as 
shown in the pull plot. From the fit we extract a signal yield of 136000 ± 600 for the 
electron channel, and 127800 ± 600 for the muon channel. Together with the number of 
generated events (given in Table |4.2[) we obtain reconstruction efficiencies of 


e(B~ ->■ A+pe~i7 e ) = (4.7422 ± 0.0021)% 
e(B~ -)• A+pp-v^) = (4.1248 ± 0.0020)%. 


(7.11) 
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'll 



pq 

a 



Figure 7.14: The two-dimensional pull of the fit to OnPeak data for the electron [(a)] and 
muon [(b)] channel. 




(a) (b) 

Figure 7.15: Log-likelihood projections on N sig in OnPeak data, for the electron [(a)] and 
muon [(b)] channel. The dashed line shows the 90% CL limit. 



Figure 7.16: Fitted ml and mss distributions, as well as the two-dimensional pull of the 
fit to WI signal Monte Carlo data for the electron channel. The dashed line represents the 
signal contribution. 
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Figure 7.17: Fitted ml and m^s distributions, as well as the two-dimensional pull of the 
fit to WI signal Monte Carlo data for the muon channel. The dashed line represents the 
signal contribution. 


7.4 Systematic Uncertainties 

Since the previously determined signal yields are compatible with 0 the use of relative 
systematic uncertainties is not possible. Instead, we have to distinguish two types of sys¬ 
tematic uncertainties and treat them separately. The first type are systematic uncertainties 
affecting dominantly the signal yield in OnPeak data, but represent a second order effect 
on the efficiency, like the used fit function. The second type of systematic uncertainties 
affect the reconstruction efficiency. 

7.4.1 Fit function 

The used fit function influences mainly the signal yield obtained in the fit to experimental 
data. On the efficiency the effect is expected to be small due to the negligible background 
contribution. Thus, we concentrate on OnPeak data for this systematic. Therefore, we 
decide to change the signal description in ml from a Cruijff to a Gaussian and for mss 
we leave the ARGUS shape parameter c floating. The parameters of the Gaussian are 

H = -0.0477 ±0.0010 
a = 0.3065 ± 0.0007 


for the electron, and 


/i = -0.0899 ±0.0010 
a = 0.2963 ± 0.0007 

for the muon channel. Fixing these parameters to the obtained values, while leaving c 
floating we obtain 


c = -30 ± 12 
N s i s = 10 ± 26 


c = -13±15 
N= -20 ± 50 


for the electron, and 
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for the muon channel. Integrating the log-likelihood as before we obtain as upper limits 
for the signal yield 


N sig (B ->• A+pe u e ) < 81 (7.12) 

N sig (B~ -)■ A+pfruJ < 130. (7.13) 


We decide to use these values for the upper limit, rather than those obtained in section 


7.2.5 as a conservative limit. 


7.4.2 B counting, tracking and particle identification 


Three standard sources of systematic uncertainties are the B counting, the charged track 
reconstruction and the particle identification. The uncertainty from the B counting can 
be directly taken from the number of BB pairs, given in eq. ( |4.4[ ). and accounts for an 
uncertainty of 0. 


The tracking uncertainty is given by the tracking group [f46l1 . and amounts to a relative 
uncertainty of 0.128% per track for the used tracking list Charged Tracks. Adding 
up these uncertainties in quadrature leads to an overall tracking uncertainty of 0.3%. 


For the uncertainty from particle identification we take advantage of the correction ta¬ 
bles applied to Monte Carlo events. These correction tables try to balance the differences 
between data and Monte Carlo events. A standard method to obtain the systematic un¬ 
certainties is to turn off these correction tables and compare the number of reconstructed 
events with each other. The difference in these numbers can be taken as systematic un¬ 
certainty from particle identification. With the correction tables enabled, we obtain signal 
yields of 


N sig (B~ ->■ A+pe~i 7 e ) = 136000 ± 600, (7.14) 

N sig (B~ -)■ A+pfTl 7 m ) = 127800 ± 600 (7.15) 

on WI signal Monte Carlo, as described in section [773] for the efficiency calculation. Re¬ 
peating this fit on WI signal Monte Carlo events without the correction tables return signal 
yields of 


N sig {B~ A+pe~V e ) = 141800 ± 600, (7.16) 

N sig (B~ -)■ A+pprv^) = 131500 ± 700. (7.17) 

The differences in the signal yield lead to systematic uncertainties of 4.3% for the electron 
and 2.9% for the muon channel. These uncertainties are mainly due to the use of the 
Tight particle ID lists in the B and the neutrino reconstruction. While the influence on 
the B reconstruction directly affect m3 since it depends on the energy of the Y system, 
the neutrino reconstruction affects the loose A E selection cut. For the latter upper limit 
we have to neglect this uncertainty, since it leads to a larger efficiency, and in turn would 
reduce the upper limit. 

7.4.3 Model dependence 

For the model dependence we have two possibilities, either we determine the efficiency 
for phase space Monte Carlo, or for a WI Monte Carlo with altered properties of the pole 
at threshold. 
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Using phase-space Monte Carlo instead of our WI model for efficiency calculation we 
obtain signal yields of 115300 ± 600 for the electron and 93100 ± 600 for the muon 
channel. This translates into efficiencies of 2.8% and 2.2%, respectively. A comparison 
with the efficiencies given in section |73| this translates into systematic uncertainties of 


Umode\(B A+pe u e ) — 41% (7.18) 

Wmodel(£ - -A A+piTV^) = 47%. (7.19) 


The fits are shown in Fig. |7. 18 



(a) 



(b) 


Figure 7.18: m 2 and m ES projections as well as the two-dimensional pull distribution for 
the fit to Monte Carlo events distributed according to phase space. l(a)]for the electron and 


(b) for the muon channel. 


For the second option we re-weight WI signal Monte Carlo to a threshold enhancement 
at the same peak position (3.225 GeV/c 2 ), but with a doubled width of 400MeV/c 2 . A 
comparison of the 200 MeV/c 2 and 400 MeV/c 2 broad m(A+p) enhancement on generator 
level is shown in Fig. |7.19 The reweighted number of generated events is 2797130 for 
the electron and 3018900 for the muon channel. To determine the analysis efficiency for 


these reweighted events we repeat the fit described in section 7.3 Here, we obtain a signal 
yield of 121600 ± 500 for the electron channel, and 1306300 ± 600 for the muon channel. 
This translates into the efficiencies 


ew(B —» A+pe u e ) = 4.34% 
e w (B~ —» A+pprVn) = 4.32%. 


(7.20) 

(7.21) 


Taking the difference to the non-weighted efficiencies, given in sect. 7.3 
uncertainties we obtain as model-dependent systematic uncertainties 


as systematic 


u mo de\(B A+pe u e ) = 8.4% 
Wmodei(£ - ->■ A+piTV^) = 4.9%. 


(7.22) 

(7.23) 
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Figure 7.19: Invariant m(A+p) mass for WI signal Monte Carlo on generator level. The 
black dots are the original enhancement with a width of 200 MeV/c 2 , while the blue trian¬ 
gles show the reweighted events with a width of 400 MeV/c 2 . 


The fitted m 2 and mss projections as well as the two-dimensional pull plots are shown 
in Fig. 7.20 As can be seen from the distributions the signal pdf describes the weighted 
signal well. 


In conclusion we have two sets of systematic uncertainties to choose from. The most con¬ 
servative comes from phase space Monte Carlo, since this represents an absolute border 
of the model dependence. The re-weighted Monte Carlo can be seen as more realistic, 
but at the cost of an unknown confidence level for this uncertainty. In order to preserve a 
90% confidence level we decide to use the systematic uncertainties determined via phase 
space Monte Carlo events. 


7.5 Upper Limit 


With the upper limit for the signal yield, determined in sect. 7.4.1| we can determine the 
upper limit for the product branching fraction B(B~ —>■ A^p^~Ve) ■ B(A+ —>• pK~TT + ) as 


N S i g 
e ' N B b 


(7.24) 


with the efficiencies e, given in eq. ( |7.11[ ), and the number of BB pairs, given in eq. ( |4.4[ ). 
This leads to 


B{B- -A A+pe-i 7 e ) ■ B(A+ pK~n+) < 3.6 x 10~ 6 

B(B~ -a A+pp~V^) ■ B(A+ -> pK~ 7T+) < 6.7 x HT 6 , ’ 

neglecting the small statistical uncertainties on the efficiency and number of BB pairs. 
These values for the upper limit are calculated at 90% confidence level, but without the 
systematic uncertainties affecting the efficiency. Here, we only have to consider the model 
uncertainties which reduce the efficiency, and lead in turn to a larger upper limit. This 
uncertainty amounts to 41% for the electron, and 47% for the muon channel. Due to 
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(a) 



(b) 


Figure 7.20: ml and m^s projections as well as the two-dimensional pull distribution 
for the fit to reweighted Monte Carlo events, (a) for the electron and |(b)| for the muon 
channel. 


these large uncertainties we have to work with asymmetric error ranges and modify the 
efficiency with a factor 1 — u e of 0.59 for the electron, and 0.53 for the muon case. 

B{B- -p A+pe~V e ) ■ B(At pA'~7r + ) < 6.1 x 10' 6 
B{B~ ->■ A+pp~ z/ M ) • B(A+ ->■ pK~ 7T+) < 12.6 x 10“ 6 . 


Correcting for the A+ branching fraction, by dividing by the world average value of 5%, 
given in Q, we obtain 


B(B —> A+pe v e ) ■ 


B(A+ —> pK 7r + ) 
5% 


B{B ->• A+pp Vp) 


B(A+ —> pK 7r + ) 
5% 


< 1.2 x 10~ 4 

< 2.5 x 10“ 4 . 


(7.27) 


We decide to use this notation, to allow for a later correction of the upper limit for a 
changed /1+ branching fraction, since the current value has a large model-dependent un¬ 
certainty. If we would take the recent Belle measurement on the /1+ branching fraction 

03 , 

B(A+ ->• pK~tt + ) = (6.84 ± 0.241°;^)% (7.28) 

the upper limits would be 


B(B A+pe v e ) ■ 
B(B~ A+pp-p^) ■ 


B(A+ —> pK 7r + ) 
6.84% 

£>(%+ —> pK~ 7T + ) 
6.84% 


< 0.9 x 10' 4 

< 1.8 x 10" 4 . 


(7.29) 


For the combined upper limit for B —> A~p(i ut we perform a simultaneous, unbinned 
maximum likelihood fit in m^s an d ml for the electron and the muon channel. The 
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(a) (b) 




(c) 


(d) 


Figure 7.21: Fit projections for the electron (Kan |(b)| ) and muon ( |(c)[ |(d)[ ) channel from 
the simultaneous fit. The dashed line shows the signal component. 


common variable is the efficiency corrected signal yield. For the fit we use the same 
functions as in section 7.4.1| where we determined the systematic uncertainty arising 
from the used fit function. Here, we fix all parameters to the obtained values in section 
|7.4.1| For the efficiency correction of the signal yields we use the values given in eq. 
( |7.11[ ). The m E s and ml projections for both decay modes are shown in Fig. 7.21 The 


likelihood in dependence of the signal yield is shown in Fig. 7.22 The Bayesian upper 


limit for the signal yield at 90% confidence level is 1279 events. With the number of BB 
pairs, given in eq. (0, this translates into 

B[B~ ->• A+p£~V t ) ■ B(A+ -► pK~ tt+) < 2.7 x 10~ 6 . (7.30) 


For the systematic uncertainty arising from the model dependence of the efficiency we 
take the average of the correction factors given in the previous section, which is 0.56, and 
obtain after the correction for the A+ branching fraction an upper limit of 

B(B~ -A A+pTv t ) • B ^ ?r+ - > < 1.0 x 10“ 4 , (7.31) 

o /o 


for the world average value of 5% for B(A+ —> pK tt + ), and for the recent Belle result 
of 6.84% an upper limit of 


B(B~ -A A+p£~u e ) ■ 


B(A+ —> pK 7r + ) 
6.84% 


< 0.7 x 10“ 4 . 


(7.32) 
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Figure 7.22: Likelihood for the simultaneous fit for the electron and the muon channel. 
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8. Conclusion 


The present analysis is a search for the semi-leptonic B decay B~ —» A^pfBVp. Neither 
for the electron, nor the muon mode a signal could be established. This led to the 90% 
CL upper limits of 


B(B —>• A+pe u e ) 
B{B~ ->• A+ppTV^) 
B(B~ -P A+prv e ) 


B(A+ —> pK 7r + ) 
5% 

B(A+ —> pK~ 7r + ) 
5% 

B(A+ —> pK~ 7r + ) 
5% 


< i.2 x nr 4 , 

< 2.5 x 10“ 4 , 

< 1.0 x 10“ 4 . 


( 8 . 1 ) 


All three limits are in a slight tension with two of the predictions given in sect. 2.3, which 
are of the order 5 x 10~ 4 . There are three possible explanations for this tension: 


1. An unknown suppression factor reduces this branching fraction. 

2. The predictions in sect. |2.3| rely on inaccurate assumptions. 

3. The decay model used in this analysis for signal simulation is not accurate. 


Although the first one is the most interesting one, it is the most unlikely explanation as 
well. For the second explanation we have to take a closer look on the two predictions 
again. The first one is derived from the branching fraction for B~ —> ppd ~by assuming 
a simple CKM suppression between the two decays. This ignores completely the different 
size of the available phase space, as well as different form factors for the fragmentation 
into the hadronic part of the final state. These form factors can depend on the invariant 
mass of the final state particles, and thus might affect the prediction substantially. The 
second prediction comes from the decay B~ —> A+pir~, assuming that 100% of this 
decay proceeds via the external W Feynman graph, which is probably not accurate. A 
possible solution for this problem comes from the isospin analysis given in ll20ll . where 
the contribution of the external graph is determined based on simple isospin arguments. 
The upper limit we obtained here is compatible with the predicted limit in ll20l and might 
corroborate the isospin argumentation. Last but not least, we have to consider the used 
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Monte Carlo model in this analysis. We modelled the semi-leptonic decay according to 
the weak matrix element, assuming a meson-like pole near threshold decaying into A+p. 
Despite being well motivated from the analysis of full hadronic baryonic 5-decays this 
pole model might prove to be wrong. Another factor here might be the quark hadron du¬ 
ality, since the weak matrix element is only valid on quark-level, neglecting the influence 
of the hadronization. 


On the experimental side the analysis suffered from the high mass of the final state par¬ 
ticles, rendering the classical signal variables used in semi-leptonic 5 decays useless, 
as was shown in sect. |7.1.1[ A similar problem was already observed in the analysis 


of 5' —>■ D+K~l~Vn [44]. To circumvent this problem, a tagged analysis approach, 
where the second B is fully reconstructed in a pure hadronic mode, is advisable. Belle II, 
which will have a multitude of the BaBAR dataset available for analysis, should be able to 
perform a tagged search for the decay 5” -A A^p£~u e . LHCb should be able to study 
this decay mode as well. The large boost and the resulting displaced vertex could enable 


them to obtain a rather clean signal for 5 
reconstruction. 


-A A~p/i ip, without the need for a neutrino 


The measurement of the semileptonic decay 5' -A A+pClsp stays crucial for the under¬ 
standing of baryonic 5-decays. A direct observation will enable us to quantify the impact 
of the external Feynman graph on the overall branching fraction of fully hadronic bary¬ 
onic 5 decays. In addition, it will provide valuable insight on the threshold enhancement 
described in sect. |2.2[ especially on the absence of such an enhancement in the decay 
5° -A B c (2455)°pn~. Consequently it will help to decide if the model given in [|T6l is 
correct. 





A. Random Forest input variables 


A.l qq random forest 




(a) (b) 



(c) 


Figure A.l: Comparison of WI signal Monte Carlo (blue histogram) and qq background 
Monte Carlo (green and red lines) for the three Random Forest input variables, |(a)|7? 2 - 
S, and [(c)] cos A6* t 


(b) 


'thrust • 
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A. Random Forest input variables 


A.2 BB random forest 




|p; I (GeV/c) U>s8, 

(a) (b) 



(C) 



KJ (GeV/c) 

(d) 


Figure A.2: Comparison of WI signal simulation events and generic B decays, (a) |p||, 
(b)lcos 9 AcP ^,KB\P x 2(Y), andl^lpri/. 

















































B. Muon Channel target variables 



b° b° 
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B+BT 


cc 

cl 
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p 1 B~ ->A +P/1 - tip 
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(a) 



(b) 



Figure B.l: Comparison of WI signal Monte Carlo and generic background Monte Carlo 
for (a) Ml, (b) Ml/E„, (c) M^ iss , anc|(d)| AE. The background Monte Carlo is scaled to 


OnPeak luminosity and stacked. The WI Monte Carlo is scaled to match the height of 
the background peak. 
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B. Muon Channel target variables 



C. Fit procedure 


C.l Correlation checks 




(a) 
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Figure C.l: tties and ml in slices of each other. On the left hand side for electron, and 
on the right hand side for muon WI signal Monte Carlo events. 
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C. Fit procedure 



m BS (GeV/c 2 ) m ES (GeV/c 2 ) 


(a) 


(b) 




(c) (d) 

Figure C.2: m ES and ml in slices of each other. On the left hand side for electron, and 
on the right hand side for muon background Monte Carlo events. 
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C.2 Fit validation 



(a) 100 signal events (b) 100 signal events 




(c) 50 signal events 


(d) 50 signal events 



(e) 10 signal events (f) 10 signal events 




(g) 0 signal events 


(h) 0 signal events 


Figure C.3: ml and m^s projections for the fit validation for the electron channel. Shown 
are the fits for 100, 50,10 and 0 input signal events. 
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C. Fit procedure 




(c) 50 signal events 




(e) 10 signal events 




(g) 0 signal events (h) 0 signal events 


Figure C.4: ml and /ties projections for the fit validation for the muon channel. Shown 
are the fits for 100,50,10 and 0 input signal events. 
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